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Preface 


Group theory was discovered by Evariste Galois in the 19th century 
for the case of finite dimensional symmetric group. It has been suc- 
cessfully generalized subsequently for the infinite dimensional case 
(i.e. Lie group and Lie algebra) by Sophus Lie. Before the advent 
of quantum mechanics in the early 20th century, group theory was 
thought by many physicists to be unimportant in the study of physics. 
It is indeed of some interest to note the following anecdote narrated 
by Freeman Dyson (as quoted in Mathematical Apocrypha, p 21, by 
S. A. Kranz), “In 1910, the mathematician Oswald Veblen (1880- 
1960) - a founding member of the Institute for Advanced Study - and 
the physicist James Jean (1877-1946) were discussing the reform of 
the mathematics curriculum at Princeton University. Jeans argued 
that they ‘may as well cut out group theory,’ for it ‘would never be of 
any use to physics’.” The real fundamental change in thinking truly 
occurred with the development of quantum mechanics. It was soon 
realized that a deep knowledge of group theory and Lie algebra in the 
study of of angular momentum algebra is crucial for real understand- 
ing of quantum mechanical atomic and nuclear spectral problems. At 
present, group theory permeates problems in practically every branch 
of modern physics. Especially in the study of Yang-Mills gauge theory 
and string theory the use of group theory is essential. For example, we 
note that the largest exceptional Lie algebra Eg appears in heterotic 
string theory and also in some one-dimensional Ising model in statis- 
tical mechanics, some of whose predictions have been experimentally 
verified recently. 

Our goal in this monograph is to acquaint (mostly) graduate 
students of physics with various aspects of modern Lie group and 
Lie algebra. With this in view, we have kept the presentation of 
the material in this book at a pedagogical level avoiding unnecessary 
mathematical rigor. Furthermore, the groups which we will discuss 
in this book will be mostly Lie groups which are infinite dimensional. 
We will discuss finite groups only to the extent that they will be nec- 
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essary for the development of our discussions. The interested readers 
on this topic are advised to consult many excellent text books on 
the subject. We have not tried to be exhaustive in the references. 
Rather, we have given only a few references to books (at the end of 
each chapter) that will be easier to read for a student with a physics 
background. We assume that the readers are familiar with the mate- 
rial of at least a two semester graduate course on quantum mechanics 
as well as with the basics of linear algebra theory. 

We would like to thank Dr. Brenno Ferraz de Oliveira (Braz- 
ingd) for drawing all the Young tableau diagrams in chapter 5. 


A. Das and S. Okubo 
Rochester 
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CHAPTER 1 


Introduction to groups 


In this chapter, we introduce the concept of a group and present 
in some detail various examples of commonly used groups in physics. 
This is helpful in establishing the terminology as well as the notations 
commonly used in the study of groups which will also be useful in 
further development of various ideas associated with groups. 


1.1 Definition of a group 
Let us start with the formal definition of a group G as follows: 


(G1): For any two elements a and 6 in a group, a product is defined 
in G satisfying 


ab=ceEG, Va,beG. (1.1) 


(G2): The group product is associative so that 


(ab)c = a(bc)(= abc), Va,b,c € G. (1.2) 


(G3): The group has a unique identity (unit) element e € G such 
that 


ea=ae=a, VaeG. (1.3) 
This implies that 


eS eeG. (1.4) 


(G4): Any element a € G has a unique inverse element a~! € G so 
that 


ag =a ‘a =e. (1.5) 
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Any set of elements G satisfying all the axioms (G1)-(G4) is defined 
to be a group. On the other hand, a set of elements which satisfies 
only the first three axioms (G'1)-(G3), but not (G4), is known as a 
semi-group. (More rigorously, a semi-group is defined as the set of 
elements which satisfy only (G1)-(G2). However, one can always add 
the identity element to the group since its presence, when an inverse 
is not defined, is inconsequential (see (1.3)-(1.4)) and we will adopt 
this definition commonly used in physics.) 

Some comments are in order here. We note that the definition 
of a group does not require that the product rule satisfy the commu- 
tativity law ab = ba. However, if for any two arbitrary elements of 
the group, a,b € G, the product satisfies ab = ba, then the group G 
is called a commutative group or an Abelian group (named after the 
Norwegian mathematician Niels Henrik Abel). On the other hand, if 
the product rule for a group G does not satisfy commutativity law 
in general, namely, if ab 4 ba for some of the elements a,b € G, then 
the group G is known as a non-commutative group or a non-Abelian 
group. Furthermore, it is easy to see from the definition (G4) of the 
inverse of an element that the inverse of a product of two elements 
satisfies 


(ab) +=b a1 Aa, (1.6) 


in general, unless, of course, the group G is Abelian. Equation (1.6) 
is easily checked in the following way 


(ab)(b~*a~*) = a(b(b-*a~")), 


1.2) with e= b71a“!, 


by ( 
= a((e")a-4), by (1.2), 
= a(ea—'), by (1.5), 
— aa; by (1. 3), 
=e, by (1.5). (1.7) 


Similarly, it is straightforward to verify that (b~!a~!)(ab) = 

In order to illustrate the proper definitions in a simple manner, 
let us consider the following practical example from our day to day 
life. Let “a” and “b” denote respectively the operations of putting 
on a coat and a shirt. In this case, the (combined) operation “ab” 
would correspond to putting on a shirt first (b) and then putting 
on a coat (a) whereas the (combined) operation “ba” would denote 
putting on a coat first and then a shirt. Clearly, the operations are 
not commutative, namely, ab ~ ba. If we now introduce a third 


1.2 EXAMPLES OF COMMONLY USED GROUPS IN PHYSICS 3 


669) 


operation “c” as corresponding to putting on an overcoat, then the 
law of associativity of the operations (1.2) follows, namely, (ca)b = 
c(ab) = cab and corresponds to putting on a shirt, a coat and an 
overcoat in that order. It now follows that the operation “bb” denotes 
putting on two shirts while “b(bb) = (bb)b = bbb” stands for putting 
on three shirts etc. The set of these operations would define a semi- 
group if we introduce the identity (unit) element e (see (1.3)) to 
correspond to the operation of doing nothing. However, this does 
not make the set of operations a group for the following reason. We 
note that we can naturally define the inverses “a~!” and “b~!” to 
correspond respectively to the operations of taking off a coat and a 
shirt. It follows then that a~ta = e = b~!b as required. However, we 
also note that the operation “aa~'” is not well defined, in general, 
unless there is already a coat on the body and, therefore, aa~! F e, 
in general (but a~'a = e always). As a result, the set of operations 
so defined do not form a group. Nonetheless, (1.6) continues to be 
valid, namely, (ab)~! = b-'a~! 4 a~'b~!, whenever these operations 
are well defined. In fact, since ab corresponds to putting on a shirt 
and then a coat, if we are already dressed in this manner, the inverse 
(ab)~! would correspond to removing these clothes (come back to 
the state prior to the operation ab) which can only be done if we first 
take off the coat and then the shirt. This leads to (ab)~! = b-'a7t. 
Furthermore, let us note that if “d” denotes the operation of putting 
on a trouser, then the operation of putting on a shirt and a trouser 
are commutative, namely, db = bd and so on. 


1.2 Examples of commonly used groups in physics 


In this section, we would discuss some of the most commonly used 
groups in physics. This would also help to set up the conventional 
notations associated with such groups. 


1.2.1 Symmetric group Sy. Although most of the commonly used 
groups in physics are infinite dimensional (Lie) groups, let us begin 
with a finite dimensional group for simplicity. Let us consider a set 
of N distinct objects labelled by {2;},2 = 1,2,---,N and consider 
all possible distinct arrangements or permutations of these elements. 
As we know, there will be N! possible distinct arrangements (per- 
mutations) associated with such a system and all such permutations 
(operations) form a group known as the symmetric group or the per- 
mutation group of NV objects and is denoted by Sy. Since the number 
of elements in the group is finite, such a group is known as a finite 
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group and the number of elements of the group (n!) is known as the 
order of the group Sy. Permutation groups are relevant in some 
branches of physics as well as in such diverse topics as the study of 
the Rubik’s cube. Note that every twist of the Rubik’s cube is a 
rearrangement of the faces (squares) on the cube and this is how the 
permutation group enters into the study of this system. There are 
various possible notations to denote the permutations of the objects 
in aset, but let us choose the most commonly used notation known as 
the cycle notation. Here we denote by (77) the operation of permuting 
the objects x; and x; in the set of the form 


Gs 23 Say gH 1s NG (1.8) 


with all other objects left unchanged. In fact, the proper way to read 
the relation (1.8) is as 


igs. pees SS as (1.9) 


This brings out the cyclic structure of the cycle notation (parenthesis) 
and is quite useful when the permutation of more objects are involved. 
It is clear from this cyclic structure that under a cyclic permutation 


(17) = (9%), (1.10) 
and when permutations of three objects are involved we have 
(ijk) : Li Lj > Lk > Xi, (1.11) 


and 


(ijk) = (ki) = (ght) (A Gik) = (kjt) = (tkj)). (1.12) 


Let us clarify the group structure of Sy in the case of N = 3, 
namely, let us consider the set of three objects (elements) {z;} = 
(x1, 22,23) where i = 1,2,3. With the cycle notation we note that 
the 3! distinct permutations of these elements can be denoted by the 
set of operations {P} with the elements in the set given by 


Pip=e= (1), Pe = (12), P3 = (23), Pa = (13), 
Py = (123); Pe = (132), (1.13) 


where we have introduced the identity element P,; = e = (1) to 
correspond to doing no permutation (every object is left unchanged 
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under this operation). The product rule for the permutations can 
now be understood in the sense of operations. For example, 


P»P3 = (12)(23), (1.14) 


implies carrying out the permutation (23) first followed by the per- 
mutation (12). We note that under this combined operation 


(23) (12) 
Ly — L1 — 2X2, 


(23) (12) 
LQ —> ©3 — £3, 


X3 Be) ee (1.15) 


so that the combined operation is equivalent to 
P)P3 = (12)(23) = (123) = Ps € {P}, (1.16) 


and the group is closed under multiplication (see (1.1)). Physically 
this can be understood from the fact that since {P} represents all 
possible distinct arrangements of the objects in the set, any two ar- 
rangements would lead to an element of the set. Following (1.15), we 
can also show that 


PsP) = (23)(12) = (182) = Ps € {P}, (1.17) 


which shows that the product rule for the permutations is not com- 
mutative 


PoP; # P3Pp. (1.18) 


The associativity of the product of permutations (1.2) can also 
be easily verified. For example, we note that 


(P2P3)P, = P5.P, = (123)(13) = (23) = Ps, 

Ps(P3P1) = (12)((23)(13)) = (12)(123) = (23) = Ps, (1.19) 
so that associativity holds and we have 

(P2P3)P4 = P2(P3P4) = P2P3Py4. (1.20) 


The identity element P, = e of the group corresponding to no per- 
mutation is clearly unique and satisfies (see (1.3)) 


P,P, = P,{Py = Py, o=1,2,---,6. (1.21) 
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Finally, the inverse of the permutations is determined easily as 
follows. For any pair permutation (77), we see that doing the permu- 
tation twice brings it back to the original arrangement of the objects. 
Therefore, we have 


PP» P3 P32 PyP4 e Py, (1.22) 
and we can identify (identity is its own inverse, (P,)~! = P,) 
(Po)"* = Pa, (P3)"h = Ps, (Pa)7) = Pa. (1.23) 


On the other hand, for permutations involving all three objects we 
see that (we are using (1.16) and (1.17) as well as (1.22)) 


Pip = Pala = le =P), (1.24) 


which determines that Ps is the inverse of Ps and vice versa. This 
shows that S3 indeed satisfies all the postulates of a group and a 
similar proof can be extended to the case of Sy as well. 

After the discussion of this simple finite dimensional group, let 
us next move onto infinite dimensional (Lie) groups (named after the 
Norwegian mathematician Sophus Lie) which are more commonly 
used in the study of modern physics. 


1.2.2 One dimensional translation group T,. Let x denote a real vari- 
able (namely, « € R and it can be either a spatial coordinate or the 
time coordinate, for example). For any real constant parameter a, 
we define an operation T(a) which acting on the coordinate x simply 
translates it by an amount a such that 


T(a)jt=ax2+a, VaeeR. (1.25) 


This is to be interpreted in one of the following two ways. First, we 
can think of T(a) as the active transformation 


T(a) 


T(a): x2— >a =2r+a. (1.26) 


Alternatively, we can interpret T(a) as a linear operator satisfying 
(1.25) with the action of T(a) leaving any constant parameter b un- 
changed (a linear operator does not act on constants, only on vectors), 
namely, 


T(a)b=b. (1.27) 
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In either case, the combined effect of two translations can now 
be determined. Viewed as an active transformation, for example, we 
can write 


T(b) 


T(a)T(b): 2—>2'=2+5, followed by 


1 Ti) on 


ge" =27'+a=24+b+4, (1.28) 
which, in the notation of (1.25), corresponds to 
(T(a)T(b)) # = T(a) (T()2) = (T()a) +a=a+b+a 
=24+(b+a)=T(b+a)rz =T(a+ dja. (1.29) 


One can represent the transformations in (1.28) diagrammatically as 
shown in Fig. 1.1. On the other hand, considered in terms of linear 
operators, it is easy to check with the use of (1.27) that 


T(b) T(a) 
———————— ee 
xL b va x" 


Figure 1.1: The combined operation of two translations T(a)T(b) 
where a translation T(b) is followed by another translation T(a). 


(T(a)T(b)) 2 = T(a) (T()x) = T(a) (w +b) = Tax + T(a)b 
=xrt+a+b=T(a+b)x=T(b+a)z, (1.30) 


which coincides with (1.29). 
We note that we may identify T(a) with the differential operator 


d 
T =l+a— 1.31 
(a) © ee ( 31) 


acting on x so that 


T(a)x = (ital )e=eta, 


T(a)b= (1 +a =) b=. (1.32) 
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Here 1 denotes the identity operator. It follows now that 


(1 as] (1 | be )e 
- (1+ +95 +a s)e 


daz da? 
= (1 (ated) =) z=T(a+)z, (1.33) 


consistent with (1.30) (or (1.29)). 

Equations (1.29) and (1.30) show that the product T(a)T(b), 
the combined operation of two translations, is well defined and cor- 
responds also to a translation, namely, 


T(a)T(b) =T(b+a) =T(a+b) =T(b)T(a), Va,bER. (1.34) 
This defines the product rule for the operation (see (1.1)) which, in 
this case, is commutative. The associativity of the product (1.2) is 


easily verified from 


(T(a)T(b))T(c) = T(a + b)T(c) = T((a +b) +c) 
=T(a+b-+c), 

T(a)(T(b)T(c)) = T(a)T(b+c) =T(a+(b+0)) 
=T(a+b+o), (1.35) 


where (1.34) has been used and (1.35) leads to 
(T(a)T(b)) T(c) = T(a) (T(b)T(c)) (1.36) 
The unit element e (see (1.3)) in this case can be identified with 
e=T(0), (1.37) 
so that it satisfies 
eT (a) =T(0)T(a) = T(a) = T(a)T(0) = T(a)e. (1.38) 


Finally, the inverse of a translation T(a) by an amount a can be 
defined as a translation by an amount (—a) 


(T(a))~* = T(—a), (1.39) 
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so that (see (1.5)) 


= T(—a)T(a) = (T(a))"' T(a). (1.40) 


Thus, we see that the set of translations T; = {T(a)} with the 
parameter of translation a € R (namely, —oo < a < co) satisfy all 
the properties (G1)-(G4) defining a group. This is a continuous group 
since the parameter of translation can take any continuous value in R 
and is known as the one dimensional translation group. Furthermore, 
since the product rule for this group is commutative (see (1.34)), this 
is an example of an Abelian group. To conclude this subsection, we 
note that if the parameters of translation were restricted to a finite 
interval Ly <a < Lg, then the set of translations will not define a 
group since two successive translations may not remain in the interval. 


1.2.3 One dimensional unitary group U(1). Let z be a complex variable 
and let us consider the phase transformation 
z> 2 =e z, (1.41) 


where @ is a real constant angular parameter 0 < 0 < 27. We can 
write the phase transformation (1.41) in terms of operators also as 
(see also (1.25)) 


U(0)z =e” z. (1.42) 


Following the discussion in section 1.2.2 we can show from (1.41) (or 
(1.42)) that the product 


U(01)U (62) = U(A1 + 62) = U(A2)U (61), (1.43) 


is well defined if we identify U(@ + 27) = U(@) (since e27* = 1) 
whenever 6, + 62 = 9+ 27 > 27. We note that in this case we can 
identify (t denotes the Hermitian conjugate, in this one dimensional 
case, only the complex conjugate) 


U'(6) =U-1(6) = U(-6), (1.44) 
namely, 
Ul (0)z =U—-1(0)z = U(-0)z =e z, (1.45) 


so that the operator U(@) is unitary 
U(0)U'(6) =e =UT(6)U(8). (1.46) 
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The set of phase transformations U(1) = {U(@), 0 < 6 < 2m} 
forms a group called the one dimensional unitary group U(1) (also 
written as U,). We see from (1.43) that it is a commtative group. 
Let us also note that we can restrict the range of the parameter of 
transformation to 0 < @ < 2m since e27’ = 1 and this has to be 
contrasted with the parameters of translations TJ, discussed earlier. 
Correspondingly, U(1) is known as a compact group (parameters have 
a compact range) as opposed to TJ; which is called a non-compact 
group. We will discuss these concepts in more detail in section 1.3. 


1.2.4 U(N) and SL(N) groups. We can generalize the group of one 
dimensional phase transformations U(1) discussed in section 1.2.3 to 
N dimensions as follows. Let A denote a N x N unitary matrix, 
namely, 


Q11 12 Q1N 
G21 G22 +++ Gon 
A= ; 5 (1.47) 
QN1 ON2 ‘+: ANN 
where the matrix elements aj;;, 7,7 = 1,2,--- , N are complex param- 


eters such that 
AAt = Ey = A'A, (1.48) 


Here Ey (= 1y) denotes the N x N unit (identity) matrix so that 
the unitarity condition (1.48) leads to the condition 


N N 
k=1 k=1 


where 6;; denotes the familiar Kronecker delta. For i = 7, equation 
(1.49) yields 


N N 
Pe eee (1.50) 
k=l k=l 


which, in turn, implies that 
loux| = 1; i,k =1,2,--- piv. (1.51) 
Let us further note that 


det AT = (det A?)" = (det A)*, det Ey = 1, (1.52) 
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where (',7,*) denote respectively Hermitian conjugate, transpose 
and complex conjugate of the matrix. With these, we can take the 
determinant of both sides in (1.48) to obtain 


det A (det A)* = | det Al? = 1, 
or, |det A] =1, (1.53) 


so that A~! always exists and can be identified, as in (1.44), with 
(see (1.48)) 


At=A", (1.54) 


and it follows that A7! is also unitary. For example, 
a 
A? (4-1) = at (4') = AtA= En, (1.55) 


which follows from (1.48). Similarly, if A and B are both unitary 
matrices, then their matrix product AB also is. Finally, the N x N 
unit (identity) matrix Ey is unitary. Thus, we conclude that the set 
of all N x N unitary matrices U(N) = {A} form a group known as 
the unitary group U(N) (or Uy). 

In a similar manner, if we consider all N x N matrices A satis- 
fying only the condition 


det A = 1, (1.56) 


we can show that they also form a group. From the matrix prod- 
uct, it follows that, for any two matrices satisfying (1.56), we have 
det(AB) = det Adet B = 1 since each of the matrices has unit de- 
terminant, namely, det A = 1 = det B. Furthermore, we note that 
det A~! = (det A)~! = 1 and det Ey = 1. (Note that since the ma- 
trices have unit determinant, the inverses exist.) This group is known 
as the special linear group in N dimensions and is denoted by SL(N) 
(or SLy). If the set of N x N matrices {A}, in addition to satis- 
fying (1.56), also satisfy the unitary condition (1.48), the resulting 
group is known as the special unitary group in N dimensions and is 
denoted by SU(N) (or SUy) which is a subgroup of both SZy and 
Uy. (The notion of a subgroup will be discussed later.) For N > 2, 
these groups are not Abelian (non-Abelian). 

The U(N) group is of interest in the operator algebra of har- 
monic oscillators in quantum mechanics. Let a; and al with 1,7 = 
1,2,--- , N denote respectively the annihilation and the creation op- 
erators for a N dimensional oscillator system satisfying 


[a;,a;]=0=[al,al], [a:,al] = 4. (1.57) 
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Here [a, b] = ab — ba denotes the commutator of two linear operators 
a and b. Let us next consider the linear canonical transformation of 
the annihilation and the creation operators in (1.57) of the form 


N 
a; —> b; = So ajay, 
j=l 
N 
al > bl = So alat,, (1.58) 
j=l 


where a;; denote constant parameters. It follows now that if the 
parameters aj; satisfy the unitarity condition (1.49), the transformed 
operators satisfy 


[b35 bj) =0'= (BBG. [o,0!] =o, (1.59) 


for i,j = 1,2,---,N. The last relation in (1.59) (which is the only 
one that needs checking), for example, can be seen as 


N N N 
[bi, bt] = ba oun Yee = > ana (ag, a] 


k=1 l=1 kl=1 
N N 
ok ok 
= ) ip 0OKe = ) UkAje = Oi; (1.60) 
kl=1 k=1 


where we have used (1.57) in the intermediate step as well as (1.49) 
in the last step. 

Since the N x N matrix A defined by (1.48) (or equivalently 
by (1.49)) is unitary, we conclude that the canonical commutation 
relations (1.57) for the creation and annihilation operators of the 
harmonic oscillator are invariant under the unitary group of transfor- 
mations U(N). More importantly, the Hamiltonian operator (normal 
ordered although the presence of a zero point energy does not modify 
the result) for the harmonic oscillator system defined by 


N 
a=: ala;, (1.61) 
i=1 


is also invariant under U(N). This is easily checked from the fact 


1.2 EXAMPLES OF COMMONLY USED GROUPS IN PHYSICS 13 


that (see (1.58)) 


N N N N 
Sitis=¥ (Sales } (Seow) 
i=1 i=1 a=1, k=l 
N N 
= oS alan > ainae, 
j,k=1 i=1 
N N 
= \~ ala,dj, = Sala; = H, (1.62) 
j,k=1 j=l 


where we have used (1.49) in the intermediate step. 

The results in (1.59) and (1.62) can be seen more easily as fol- 
lows. Let us arrange the two sets of N annihilation operators into 
column matrices as 


ay by 
a2 ee be 

wt en ae oe eae (1.63) 
an bn 


Correspondingly the two sets of creation operators a and bt would 
be denoted by row matrices. With this notation, the transformation 
(1.58) can be written in the matrix form (see also (1.47)) 


b= Aa, bt =alal, (1.64) 


and the relations (1.59) and (1.62) follow if the transformation matrix 
A is unitary, namely, if it satisfies (1.48). For example, we note 
that (the summation over all modes is automatically contained in 
the matrix multiplication) 


bb = al At AG = G14 = ala = H, (1.65) 


The invariance of the Hamiltonian of a dynamical system under a 
transformation defines a symmetry of the system as we will discuss 
later. 


1.2.5 O(N) and SO(N) groups. In the earlier example of the unitary 
group studied in section 1.2.4 we assumed the N x N matrix A to be 
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a complex unitary matrix. On the other hand, if we restrict further 
the matrix A to be a real matrix, namely, 


Pe we (1.66) 
then (1.48) leads to 
AA? = Ey = A’ A, (1.67) 


which follows from the fact that At = (A*)? = A? for real matrices. 
Here A’ denotes the transpose of the matrix A, namely, 


(A*) ig = Aja, (1.68) 


so that explicitly we can write (see (1.47)) 


11 O21 ON1 

T O12 O22 °°" aN2 
7s ce Hts (1.69) 

Q1N A2N *"* ANN 


We note that the restriction on the matrix elements following from 
(1.67) has the form 


N N 
S ainajn = bij = D> anians; (1.70) 
k=l k=l 


which can be compared with (1.49) (for real elements, aj; = aj;). In 
particular, for i = 7, relation (1.70) leads to 
N N 
Gay Hi Sy Gays PSU, (1.71) 
k=1 k=1 


so that the elements of the matrix A are restricted to have values 
—1 < aij = lfor any 7,7 = 1,2,-- ,N.- 

We can readily verify as in the last section that the set of all N x 
N real matrices satisfying (1.67) forms a group which is called the real 
orthogonal group in N dimensions and is denoted by O(N) (or On) 
(the matrices A satisfying (1.67) are known as orthogonal matrices). 
Furthermore, taking the determinant of both sides of (1.67), we note 
that since det A? = det A, we have 


(det A)? = 1, 
or, detA=+1. (1.72) 
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If we require the orthogonal matrices A to satisfy the additional con- 
dition 


det A=1, (1.73) 


then it can be shown that all such matrices define the real special 
orthogonal group in N dimensions which is denoted by SO(N) (or 
SOn). Sometimes it is also known as the group of rotations in N 
dimensions identified as Ry. This nomenclature can be understood as 
follows. Let us consider the N dimensional Euclidean space labelled 
by the coordinates 


Sel Gs (1.74) 
UN 


Rotations define the maximal symmetry (which leaves the origin in- 
variant) of such a space and the length of a vector in this space defined 
as 


x?x =a? +a2+--.+ a4, (1.75) 


is invariant under rotations. Let us note that if we define rotations 
through N x N matrices R as 


xx’ = Rx, (1.76) 
it follows from (1.75) that the length remains invariant, namely, 
x!? x! = x? RU Rx =x! x, 
Of. (2, PG Peay Sa +a eae, 7) 
only if 
R'R= Ey. (1.78) 


Comparing (1.78) with (1.67) we recognize that the rotation matrices 
Rin N dimensional Euclidean space correspond to orthogonal ma- 
trices and correspondingly the O(N) group is also identified with the 
rotation group Ry. The difference between the O(N) and the SO(N) 
groups lies in the fact that the latter group does not include “mirror 
reflections” (which render the matrices A to have det A = —1). (Note 
that the length of a vector defined in (1.75) is clearly invariant under 
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“mirror reflections” x; — —2; for some of the coordinates.) Corre- 
spondingly SO(JN) is also known as the group of proper rotations. We 
note here that the rotation group SO3 (without “mirror reflections” ) 
plays an important role in the study of the Schrédinger equation with 
central potentials in general and the group SO, clarifies the origin of 
the “accidental degeneracy” in the case of the Hydrogen atom (which 
we discuss in chapter 3). 

A group that is very important in the study of relativistic sys- 
tems is the Lorentz group. It is the maximal symmetry group (which 
leaves the origin fixed) of the four dimensional space-time Minkowski 
manifold (space) just as the three dimensional rotations (which leave 
the origin invariant) define the maximal symmetry group of the three 
dimensional Euclidean manifold (space) (and are important in non- 
relativistic phenomena). In the case of the four dimensional space- 
time manifold, if we define a coordinate four vector as (t denotes the 
time coordinate, x the three dimensional spatial coordinate vector 
and c represents the speed of light) 


°, x) 


Ces 6 
= (2°, 2, 2, 2°) 
= (ct,z',27,2°), pw =0,1,2,3, (1.79) 


then the length invariant under a Lorentz transformation (which in- 
cludes three dimensional rotations as well as Lorentz boosts), is given 
by 


CPx =P — PH?’ — ey, (1.80) 


and is known as the invariant length of the Minkowski space. Denot- 
ing the coordinate four vector by a four component column matrix 


(see (1.74)) 


x x 
ae =e (1.81) 


and introducing a 4 x 4 constant diagonal matrix (metric) 


0 

-1 0 0O 
1.82 

0 ’ (1.82) 


3 
| 
ooorF 
j=) 
| 
ra 


0 O -1 
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we note that we can write the invariant length (1.80) as 


a ne = Ct? — (21)? = (2)? — (23). (1.83) 


Under a Lorentz transformation the coordinates transform as 


0 10 0 
v1 
12 =A 


13 


8 8& 8&8 


1 
3 


where A is a 4 x 4 matrix implementing the Lorentz transformation. 
Since the coordinates are real, it follows that the matrix A represent- 
ing the Lorentz transformation is also real, namely, 


A= A. (1.85) 


Furthermore, the invariance of the length (1.80) (or (1.83)) now leads 
to (the constant matrix 7 does not transform under a Lorentz trans- 
formation) 


a! na! = 27 A nAx = x" ne, (1.86) 
which is possible only if the matrix A denoting a Lorentz transfor- 
mation satisfies the relation 


A’ nA =n. (1.87) 


Relation (1.87) is quite similar to (1.67) (see also (1.78)) if we replace 
the 4 x 4 constant diagonal matrix 7 by E4(= 14). Therefore, we sus- 
pect that the Lorentz transformations are related to the orthogonal 
matrices in some manner. 

To understand this better, we note that the diagonal elements 
of the relation (1.87) lead explicitly to 


3 
(ey,)" => Gia? Se pS 0, 1,2, 3. (1.88) 


i=1 


Comparing with (1.71) we see that the matrix elements of a Lorentz 
transformation satisfy the same relation as those for the orthogonal 
matrices except for the relative negative signs in the last three terms 
(which basically arise from the structure of the metric 7). Corre- 
spondingly, the Lorentz transformations are said to belong to the 
orthogonal group O(3,1) which is different from the group O(4) in 
that the metric in the case of the former has three negative and one 
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positive signature (as opposed to the case of O(4) where the metric 
is the identity matrix). 
We point out here that it is possible to identify 


a* = —ia® = -ict, (1.89) 
and introduce a coordinate four vector as 
a = (x, x*) 
= (x, x2, 23, 24) 
= (ele a, ich); = 1;2,3,4 (1.90) 


and denote it as a four component column matrix as 


z= @) = : (1.91) 


—ict 


In this case the invariant length would have the standard form with- 
out (the metric) 7 (see (1.75)) 


—#7# = —(x? + (#4?) = —((a')? + (@)P + (@P - PP) 


=? —(2') —@’) = (2?) (1.92) 


If we now define a Lorentz transformation through a 4 x 4 matrix A 
as 


ql gli Gl 
2 ~12 ~2 

~ z zB) x ~1|z ~ 

Z=] 2] 97 =] - SA fie: |S Ax 1.93 
ze ql 3 ze ’ ( ) 
a ql 4 A 


then the invariance of the length (1.92) under the Lorentz transfor- 
mation (1.93) would require 


—§'T 3! = —RAT At = —@7 3, (1.94) 
which is possible only if 
ATA = E4. (1.95) 


This is precisely the orthogonality condition in (1.67) and this would 
suggest that the Lorentz transformations belong to the orthogonal 


1.2 EXAMPLES OF COMMONLY USED GROUPS IN PHYSICS 19 


group O(4). However, as we have already seen, this is not true and 
the difference arises from the fact that because the coordinates in 
(1.90) or (1.91) are now complex (and the Lorentz transformation 
preserves the nature of the coordinates), the transformation matrix 
A is no longer real (see (1.66)), 


AAA. (1.96) 


In fact, from the nature of the coordinates in (1.91), it follows that 
the matrix elements of the transformation matrix would have the 
structure 


Poke Nn i ed Pa ake = aa, ~ ae 
ai; — ais, Ag4 — a44, a4; — —adgi, Qi = —aiA4, (1.97) 


where i,j = 1,2,3. Asa result, the diagonal elements of the orthog- 
onality relation in (1.87) in this case lead to 


4 


yaa a (1.98) 


v=1 


for any fixed value of w = 1,2,3,4. For example, for uw = 4, relation 
(1.98), together with (1.97), leads to 


3 
So (Gia)? + (G44)” = |@aa|? — |@ia|? = 1, (1.99) 
=I 


which brings out the O(3,1) nature of the Lorentz transformations 
discussed in (1.88). Note that (1.99) implies that in this case we can 
have |d44|,|@:4| > 1 in contrast to the U(N) case (see (1.51)). 

Let us conclude this section by simply noting that when we 
combine space-time translations with the Lorentz transformations, 
we obtain the Poincaré transformations and they form an interest- 
ing group. This is an important symmetry group in the study of 
relativistic systems. In four dimensions, for example, the Poincaré 
transformations act on the space-time coordinates as 


go 7/0 0 a? 

1 ial 1 1 

x : x x a 
c= oa = =A + 1.100 
pe gle ae Pee ( ) 

ge a Pe) a3 


where the 4 x 4 matrix A defines a Lorentz transformation (discussed 
in (1.84)) and a4 = (a°,a',a?,a®) = (a“)* denotes the four vector 
representing space-time translations. We will go into more details of 
these two important groups in physics in a later chapter. 
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1.2.6 Symplectic group Sp(2N). Let us introduce the 2N x 2N matrix 
€ of the form 


0 1 0 0.. : .202 4G 0 0 
(Gra) 00 00 0 0 
0 0 0 1\ 0 0 0 0 
_{| 0 0 ia 0 0 
eS | 2000 0 0 Or i0% |" 

0 0 0 0 0 0 
0 0 00 00 Oy ti 

0 0 00 ae 


(1.101) 


which is a block diagonal matrix with the anti-symmetric 2 x 2 matri- 
ces 109 = Cy 6) along the diagonal (a2 represents the second Pauli 


matrix). By construction, this matrix is anti-symmetric, 


ny =—@Nn, (eon)? = —Enn, (1.102) 


where F2xy = lon denotes the 2N x 2N identity (unit) matrix. Such 
a matrix is known as a symplectic structure (also known as a complex 
structure) and is by definition even dimensional. Let us next consider 
a 2N x 2N matrix A which preserves the symplectic structure €2n, 
namely, 


A’ en A= enn = AeqyA’. (1.103) 


(The relation (1.103) can be compared with (1.67) with egy > Eon = 
lox.) Such a matrix is known as a symplectic matrix in 2N dimen- 
sions and the set of all 2N x 2N matrices satisfying (1.103) can be 
shown to define a group known as the symplectic group in 2N dimen- 
sions denoted by Sp(2N) (or Spon). Any symplectic matrix satisfying 
(1.103) is known to have unit determinant (det A = 1) so that it is 
automatically special. We do not go into the general proof of this 
here since it is rather involved. 

The symplectic groups are of some relevance in the study of 
quantum mechanics. For example, it is straightforward to show that 
the real symplectic group Sp(2N,R) (which consists of symplectic 
matrices which are real) is the invariance group of the N-dimensional 
Heisenberg algebra. Let us recall that if gq; and p; with i = 1,2,---,N 
denote the canonical coordinate and momentum (Hermitian) opera- 
tors, they satisfy the canonical (Heisenberg) commutation relations 


lai, qj] =90 = [pi,p;], [a, pj] = 1d,;, (1.104) 
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for 1,7 = 1,2,---,N. This is commonly known as the Heisenberg 
algebra. Let us next introduce a 2N-dimensional vector operator 
Eu, H=1,2,--- ,2N with the components 


Eu = (M1, P1, 92, P2,°** ,9N, PN). (1.105) 


In terms of €,,, the Heisenberg algebra (1.104) can be written com- 
pactly as 


[Eu &v] i thE ww, boreal 2 ,2N, (1.106) 


where €,,, corresponds to the (u,v) element of the matrix € defined 
n (1.101). 
Let us now make a transformation of the vector €,, as 


2N 


bu > a = S- Buv&v, (1.107) 


v=1 


where {,,, are constant parameters. It follows now that 


2N 
kare l= y Burd, : fia = S° BurBue [E), Ee 


A,o=1 


2N 


= ih Y7 BurBvoere- (1.108) 


A,o=1 


This implies that the transformed variables aA would also satisfy the 
Heisenberg algebra 


[6 &] = thew, (1.109) 


if the transformation parameters 6, satisfy 


2N 2N 
S> BurBve®ro = Se Biuxtie bie =< inl: (1.110) 
A,o=1 A,o=l 


Introducing the 2N x 2N matrix 


en es = ee hs (1.111) 
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we note that we can write (1.110) in the matrix form as (compare 
with (1.103)) 


AeA? =«. (1.112) 


Namely, we see that the real symplectic group of transformations 
S'p(2n, IR) defines the invariance group of the N-dimensional Heisen- 
berg algebra (the reality condition arises from the fact that the op- 
erators are Hermitian). Unfortunately, however, the Hamiltonian, 
say for the harmonic oscillator system, is not invariant under the 
Sp(2n, R) group of transformations defined by (1.107) and (1.112). 
More explicitly, 


1 1 
H=5) (i +@) #5) (Hi)? +@), (1.113) 
i=1 i=l 
where 


so that it is not a symmetry of the harmonic oscillator system. 

The special case of the symplectic group for N = 1 is of some 
interest because it arises in various branches of physics. In this case, 
we can identify Sp(2) = SL(2) as follows. Let 


Se @ 5) . (1.115) 


denote a 2 x 2 symplectic matrix. Therefore, by definition (see 
(1.103)) it satisfies 


Aven A = €9, 
= GAG I)G )-G 9: 
ot ere By) as F (f, ) 


0 1 0 1 
or, detA i 5) = & . 
or, det Ae =e, (1.116) 


where we have used the definition of the determinant, 


det A = ad — By. Cit) 
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Equation (1.116) determines that 
det A = ad — By = 1, (1.118) 


for a mtrix to belong to Sp(2). However, this is also the condition for 
a matrix to belong to SL(2) and this proves the equivalence (isomor- 
phism) between the two dimensional groups Sp(2) and SL(2). This 
fact can be used to construct 2-component spinors in physics as we 
will show in a later chapter. 

These two dimensional groups are also quite relevant in the 
study of Bogoliubov transformations (among other things). For ex- 
ample, let a and a! denote the annihilation and creation operators 
for a harmonic oscillator system satisfying (see, for example, (1.57)) 


[a,a]=0=[al,a"), [a,a'] =1. (1.119) 
Let us consider a transformation of the operators of the form 
a>b=aa+ Bal, 
al +b! = B*tataral, (1.120) 


where we assume that the constant parameters of the transformation 
a, 8 satisfy 


aa* — BB* = 1. (1.121) 


In this case, it is straightforward to check that the transformed op- 
erators satisfy the algebra 


[o,b] =O =[b',b'],  [b,o'] = 1. (1.122) 
For example, we note that 
[b, b'] = [aa + Bal, Bra + ata] 
= aa*|a, a'] + 66*[a', a] = aa* — BB* = 1. (1.123) 


We note that if we define the matrices 


(SQ AG Bom 


then we can write the transformation (1.120) in the matrix form as 


@>b= Aa, AeAT = ATA = det Ae. (1.125) 
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Here we have used the identification in (1.116). The commutation 
relations in (1.119) can now be written compactly as 


Calculating the commutation relations for the transformed operators 
in (1.125) (see (1.108)), it follows now that 


[b:, 6;| = (Ae2A™),. = det A (€2) 


2 - (1.127) 


for 7,7 = 1,2. The invariance condition for the commutation rela- 
tions, (1.121), is now seen to correspond to 


det A = 1. (1.128) 


A transformation mixing the annihilation and creation operators is 
an example of a Bogoliubov transformation and we see that it is 
implemented through a matrix A € SL(2). (This is useful in the case 
of harmonic oscillator, for example, in the construction of coherent 
states.) We note here that the matrix A is not unitary, namely, 
AtAF Ep. 

To conclude this discussion, we note that the groups SL(N),U(N), 
O(N), SO(N), Sp(2N) are termed classical groups and discussed in 
detail by Hermann Weyl] in his famous book Classical Groups. The 
book also includes a discussion of the symmetric group Sj as well as 
a more general linear group GL(N). Without going into details, we 
simply note here that the general linear group GL(N) consists of all 
N x N matrices A with only the restriction det A 4 0 (nonsingular) 
so that the inverse A~! always exists. 


1.3 Group manifold 


In this chapter, we have discussed a finite group (with a finite number 
of elements) as well as several (Lie) groups which are quite impor- 
tant in physics. The (Lie) groups, as we have seen depend on some 
continuous parameters and correspondingly they are known as con- 
tinuous groups which have an (nondenumerably) infinite number of 
elements. Let us introduce the concept of a group manifold for such 
groups which would help us in classifying infinite dimensional contin- 
uous groups (examples of which we have studied in the last section) 
into compact and non-compact groups. Basically, the parameters 
on which elements of a continuous group depend define a topologi- 
cal space known as the parameter space. Since the group elements 
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depend on these parameters, every element can be put in one-to- 
one correspondence with points on the parameters. In this way, a 
continuous group inherits the topology of the parameter space and 
correspondingly is also known as a topological group. 

We will illustrate these ideas through examples. Let us consider 
the one dimensional translation group 7; discussed in section 1.2.2. 
In this case an element of the group T(a) represents a translation 
of the coordinate (which can be a space or a time coordinate) by a 
real continuous parameter “a” with —oo < a < oo. Therefore, the 
parameter space, in this case, corresponds to the real line and it is 
clear that we can represent every group element of TJ; diagrammati- 
cally as a point on the real line. For example, a specific element T(a) 
would have the coordinate “a” on this real line from the origin which 
can be identified with the identity element e = T(0) (see (1.37)) as 
shown in Fig. 1.2. Topologically the real line can be identified with 
a Cartesian axes, say for example, the x-axis. Therefore, this shows 
that there is a one-to-one correspondence between the elements of 
the group 7; and points on the z-axis and we can identify the group 
manifold of T; with the whole (entire) real line. 


Figure 1.2: Diagrammatic representation of one dimensional transla- 
tions on a real line. 


Let us next consider the example of the one dimensional unitary 
group Uj discussed in section 1.2.3. A typical group element U(@) = 
e'°,0 < @ < 2m corresponds to a simple (one dimensional) phase 
rotation by an angle 6 (see, for example, (1.41) and (1.42)). Asa 
result, the parameter space, in such a case, can be identified with the 
unit circle (labelled only by an angle) and every element U(@) of the 
group U; can be assigned a unique angle @ on the unit circle (angle 
which a vector from the origin to the unit circle makes with the z- 
axis) as in Fig. 1.3. Namely, there is a one-to-one correspondence 
between the elements of the group U; and points on the unit circle 
and we can identify the group manifold of U; with the unit circle. It is 
worth noting here that since U(@ + 27n) = U(@) for any integer N, if 
the parameter @ were allowed to take any arbitrary value (without the 
restriction 0 < @ < 27), there can no longer be a simple one-to-one 
correspondence between the group elements and the unit circle. In 
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this case, the parameter space coincides with that of the real line and 
the topological space known as the covering group of U(1) becomes 
equivalent to the group manifold of T). 


Figure 1.3: Diagrammatic representation of the one dimensional uni- 
tary group on a unit circle. 


We now turn to the more complicated case of the group Uy 
discussed in section 1.2.4. Here the group elements are realized as 
N x N unitary matrices A whose elements a;;,7,7 = 1,2,--- ,N are 
complex parameters. In general, a matrix with N? complex param- 
eters (elements) can be equivalently thought of as described by 2N? 
independent real parameters. However, the unitarity condition (1.48) 


AAl = Ey = A'A, (1.129) 


imposes NV? real conditions on the elements of a N x N unitary matrix 
(note that the identity (unit) matrix is real which is the reason for 
the N? real conditions). As a result, the matrix A can be thought 
of as being parameterized by N? independent real parameters which 
we can take to be, for example, Reaj;,7,7 = 1,2,--- , N. Therefore, 
it would appear that the parameter space can be identified with a 
Euclidean manifold with N? independent axes. However, we note 
from (1.51) that the magnitudes of the elements a;; are bounded by 
unity so that all the points in this space must satisfy 


[Re ay;| <1, (1.130) 


and the space corresponding to the unit ball in N? dimensions is 
bounded just like the unit circle (in the case of U(1)). Since every 
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element of U(.N) can be mapped to a point in this unit ball, in this 
case the group manifold can be identified with the unit ball in N? 
dimensions. The group manifolds for more complicated continuous 
groups can similarly be constructed and are much more complicated 
in general. Nonetheless we can always assign a correspondence be- 
tween the elements of a given group and points on a topological space 
(defined by the parameters) which is known as the group manifold 
for the particular group. 

A topological space with a given topology (see, for example, 
the book by J. L. Kelley) is called compact if it has a bounded and 
closed topology. The space is known as non-compact otherwise. The 
continuous groups are correspondingly classified as compact or non- 
compact depending on the nature of the group manifold (topological 
space) associated with them. It is clear from this that the group of 
translations TJ) is non-compact since the real line associated with this 
group is not bounded. On the other hand, the unitary group U(N) 
is compact since the group manifold is bounded. In a similar man- 
ner, it can be shown that the groups SU(N), SO(N) and O(N) are 
also compact. However, it is worth noting that orthogonal groups 
of the form O(m,n) with nontrivial m,n are non-compact. For ex- 
ample, in the case of the group O(3, 1) we note from (1.88) that the 
parameters of the transformation (matrix elements a,,) do not have 
to be bounded in order to satisfy (1.88) (because it involves a differ- 
ence). As a consequence, Lorentz transformations are non-compact 
(the boosts are not bounded). Similarly, the Poincaré group is also 
non-compact since the space-time translations are not bounded (in 
addition to the fact that the Lorentz transformations are, in general, 
not bounded). We note here that the group SLy is also not compact. 
This can be easily seen by restricting to n = 2 where we can label 
the matrix as (see (1.115)) 


_(% B 
A= (° 4 (1.131) 
satisfying (see also (1.117) and (1.118)) 

det A= ad — By = 1. (1.132) 


However, without the unitarity condition (1.48) (for example, in the 
case of SU(2)) or the orthogonality condition (1.67), the matrix ele- 
ments a, 3,7, 0 satisfying only (1.132) are no longer bounded (namely, 
they can be arbitrarily large and still satisfy (1.132)). As a result, 
the group manifold (and, therefore, the group) is non-compact. We 
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note that since the groups SL(2) and Sp(2) are equivalent, S'p(2) is 
non-compact as well (so is Sp(2N)). Finally, we note that any finite 
group can be thought of as compact since it is bounded and closed 
(in the sense of a finite set). 
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CHAPTER 2 


Representation of groups 


While the structure of the abstract group is quite interesting, in 
physics the representation of the group itself is more important. The 
most common representations of (Lie) groups are in terms of finite di- 
mensional nxn matrices, although sometimes the infinite dimensional 
(unitary) representations may be more meaningful for non-compact 
groups. In this chapter we will discuss the matrix representations 
as well as representations of groups by linear operators in a vector 
space. 


2.1 Matrix representation of a group 


The nxn matrix representation of a group G is defined as follows. For 
every element of the group a € G, if we can construct an invertible 
n Xn matrix U(a) such that 


(i) we can identify 
U6) = Eas (2.1) 
where E,, = 1, denotes the n x n identity (unit) matrix, 


(it) and for any two elements a,b € G, the matrices U(a) and U(b) 
satisfy the relation 


U(a)U(b) = U(ab), (2.2) 


then, the set of all such n x n matrices {U(a)} forms a group and 
provides a (n x n matrix) representation of the group G. We note 
that in (2.2) the product on the left hand side is the natural associa- 
tive matrix product which defines the product rule for such a matrix 
group. 

Indeed it is straightforward to check that such a set of matrices 
forms a group in the following manner. 
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(i) 


(ii) 


(iit) 


(iv) 


We note from the defining relation (2.2) that 
U(a)U(b) = U(ab) = U(c), (2.3) 


is well defined with the matrix product rule and is in the set of 
matrices {U(a)}. Here we have used the group property (1.1), 
namely, for any two elements a,b € G of the group, the group 
multiplication leads to ab = cE G. 


Since the matrix product is known to be associative, it follows 
that for any three matrices in the set we have 


(U(a)U(b))U(c) = U(a)(U(b)U(o)), (2.4) 


which should be compared with (1.2). In fact, given the group 
associative property (1.2) we note that using (2.2) we can write 


(U(a)U(b))U (©) = U(ab)U(e) = U((ab)c), 
U(a)(U(b)U(c)) = U(a)U (bc) = U(a(be)), (2.5) 


so that we have 
(U(a)U(b))U(c) = U(a)(U(O)U(e)) = U (abe), (2.6) 
since (ab)c = a(bc) = abc (see (1.2)). 


By definition (2.1), the set has a unique identity (unit) element 
U(e) = E, which, using the matrix product, satisfies 


U(e)U(a) = E,U(a) = U(a) = U(a) En = U(a)U(e), (2.7) 
which should be compared with (1.3). The relation (2.7) can 
also be viewed from the point of view of group multiplication 
as 

U(e)U(a) = U(ea) = U(a) = U(ae) = U(a)U(e), (2.8) 
corresponding to ea = a = ae. 


Since by definition 


=U(a“)U(a), (2.9) 


2.1 MATRIX REPRESENTATION OF A GROUP 


31 


it follows that 


det U(a) det U(a~') = det (U(a)U(a™")) 


= det U(e) = det FE, = 1. 


This shows that 
det U(a) 4 0, 
so that (U(a))~! exists and satisfies 
U(a)(U(a))~* = En = (U(a)) *U (a). 
Comparing (2.9) and (2.12) we conclude that 


(U(a))~* = U(a™). 


(2.10) 


(2.11) 


(2.12) 


(2.13) 


Conversely, we can multiply (2.12) with U(a~') either from left 
or from right and use the group composition to show that the 


identification in (2.13) follows. 


This shows that the set of n x n matrices constructed satisfying (2.1) 
and (2.2) forms a group and provides a matrix representation for the 


group G. 


We note here that if the set of matrices {U(a)} defines an x n 
matrix representation of a group G, then the set {S~'U(a)S}, where 
S denotes any arbitrary n x n invertible matrix, also provides an xn 
matrix representation of the group G. This is easily seen by noting 


that if we identify 
W(a) = S-'U(a)S, 
then, using the matrix product 
(i) we have 


W(e)= S~!U(e)S = Gb gas = Fe, 


(it) and, furthermore, 


(2.14) 


(2.15) 


W(a)W(b) = S-'U(a)SS~1U(b)S = SU (a) E,U(b) 8 


= S~'U(a)U(b)S = S~'U(ab)S = W(ab), 


(2.16) 
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which can be compared with (2.1) and (2.2) respectively. Follow- 
ing the same arguments as before, we conclude that the set {W(a) = 
S~!U(a)S} also defines a group and provides an xn matrix represen- 
tation of the group G. This suggests that the matrix representation 
of a group G may not be unique since we can construct infinitely 
many sets of matrices similarly related to the set {U(a)} for different 
invertible matrices S. However, we regard all such sets of matrices, 
related by a similarity transformation, to be equivalent and do not 
consider {W(a)} to provide a distinct representation of G. This can 
be understood in the following manner. 

In the last chapter, we have pointed out that abstract groups can 
be realized as groups of transformations acting on appropriate linear 
vector spaces. For example, we have seen that the SO(N) group 
can be associated with the group of rotations in a N-dimensional 
coordinate space while the U(N) group rotates the creation and an- 
nihilation operators of N harmonic oscillators. Transformations in 
a linear vector space may be implemented through linear operators 
acting on vectors. (Only for a very unique discrete transformation 
is the corresponding operator anti-linear.) Therefore, given a group 
G, we may identify every element of the group (a € G) with a linear 
operator A in a finite dimensional vector space Vp, 


a— A. (2.17) 


Let us consider an orthonormal basis {x;},7 = 1,2,---,n in Vp. 
A linear operator in V,, takes one vector in the vector space into 
another which can be expanded in the basis |x;) (the Dirac notation 
for a state vector). In particular, we note that 


n 
Ae aye, a1 2 ee (2.18) 

j=l 
The n? coefficients of expansion a;; describing the action of the linear 
operator A on the n basis vectors x;, 1 = 1,2,--- ,n can be thought 
of as the components of an xn matrix giving a matrix representation 
U for the linear operator A. In this way, we can construct an x n 
matrix representation for the group G through linear operators acting 

on V,, namely, 


a> A—U(a), (2.19) 
with 


(U(a))ij = ay, (2.20) 
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which can also be written explicitly as 


Q41 12 °°: Ain 
Q21 22 °°: Aan 

uay=f (2.21) 
Qnl An2 *** Onn 


As a result, the relation (2.18) can also be written as 


n 


Ax; = S°U(a)) ji. (2:22) 


j=l 


We can verify that these matrices indeed satisfy the group prop- 
erties in the following manner. 


(i) We note that if we identify (where 1 denotes the identity oper- 
ator) 


eI, (2.23) 


then it follows that since the identity operator leaves every vec- 
tor invariant 


la, =27;, 1=1,2,---,n, (2.24) 
and we can identify 


(U(e))ig = diy, 


LG ee 0 
G. ay vox 3G 

or, Ule)=] | 2 ces (2.25) 
0 1 


(ii) Furthermore, for any two other elements of the group 
os BL. CSG, (2.26) 
where B and C are linear operators acting on Vy, if we have 


1 me a (2.27) 


34 


2 REPRESENTATION OF GROUPS 


then we can write 
AB=C. (2.28) 


On the other hand, following (2.18)-(2.22), we note that we can 
write 


Bay = D7 Byes = DU ()) 03, 
j=l j=l 

Cas = >) 823 = DU (0) seas, (2.29) 
j=l j=l 


where we have identified 


(U(b))i5 = Biss 


Bu Pre Bin 
B21 Boa Bon 
or, U(b) = 7 | 
Bni Bn2 Ban 
and 


Yll sY12 «°° Yin 
YO YOR YOR 

on. Ue =~ nae ie (2.30) 
Ynl Yn2 °*°*  Ynn 


It follows now that (by definition, a linear operator acts only 
on vectors and not constants) 


=) _ (U(aQ)U(b) nite. (2.31) 
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Comparing (2.31) with (2.28) and (2.29), we conclude that 
(U(a)U (b))ig = (U (ab) )ig = (UO) as. (2.32) 


It follows now from (2.1) and (2.2) that the set of matrices 
{U(a)} do satisfy the group properties and provide a matrix 
representation of the group. 


However, we note that the matrix elements of a linear operator 
are basis dependent. For example, as we have seen in (2.20) and 
(2.22), in the basis x;, i = 1,2,--- ,m we can write 


n 


At =. (UG) pap. (2.33) 


j=l 


On the other hand, if we have another basis |y;), 7 = 1,2,---,n in 
Vn, related to |x;) through a similarity transformation S (det S 4 0 
by definition of a similarity transformation, namely, it is invertible), 
we can write 


y= >) Spey (2.34) 
It follows now that 


Ay; = AL Sy = Sides 


j=l 
= Ss" Syi(U(@) )ajtk = Ss" Syi(U(a))nj(S-*)exye 
jk=1 jk l= 
=) (STU (a)S) eye, (2.35) 


where we have used (2.22) as well as (2.34). The coefficients of ex- 
pansions in the new basis |y;) are given by the elements of the matrix 
S—!U(a)S which would also provide a matrix representation of the 
group, namely, W(a) = S~'U(a)S (see (2.14)). Thus, we see that 
the same linear operator can lead to different matrix representations 
of the group depending on the choice of basis. Since the choice of a 
basis does not alter the physical contents of operators, all such ma- 
trix representations are considered to be equivalent to U(a) and not 
distinct from U(a) unless stated explicitly. 
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As an aside we note that in (2.34) we have represented the sim- 
ilarity transformation as 


n 
yi = Ss" Syitj, (2.36) 
j=l 
instead of the conventional notation 
ye = > Spey (2.37) 


The two relations (2.36) and (2.37) are, of course, physically equiv- 
alent and can be understood as associated with the active or the 
passive description of transformations. For example, let us consider 
a given vector V in the vector space which can be expanded in the 
basis x; as 


i=1 


Under a transformation 
VoV'=SV, (2.39) 


the change in the vector can be viewed in one of the two following 
equivalent ways. We may expand the transformed vector in the same 
basis as 


n 
A ee Ss U,Li, (2.40) 
i=1 


which can be thought of as implying (see (2.39)) that the basis (of 
expansion) is unchanged under the transformation, rather the coef- 
ficients of expansion change. In other words, the vector transforms, 
but not the basis (of expansion). This is known as the active picture 
of transformation. In this case, using the linear nature of operators 
and (2.18) we determine from (2.38)-(2.39) and (2.40) that 


n 


y' =SV= uch = Ss" UjS Lj 


i=1 ij=l 


n n 
= So (Sv) ;2; = Souja, 
gat i=l 


or, v,= (Sv); = Se Siz; (2.41) 
j=l 
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so that we can write 


n n 
as =— Souja = S SyjVj Tj. (2.42) 
i=1 


t,j=1 


In particular, in this picture, if we consider the vector V = a;, for 
which the expansion (2.38) takes the form (6;; represents the Kro- 
necker delta) 


n 
V= Ly = Ss" OijLj, (2.43) 
j=l 
then it follows from (2.42) that under the transformation 
n 
at, =y => Sig03, (2.44) 
j=l 


which coincides with (2.37) and shows that this form is associated 
with the active view of the transformation. 

Alternatively, we can view the transformation in (2.39) as chang- 
ing the basis of expansion leaving the coefficients of expansion un- 
changed. This would then correspond to leaving the vector unchanged 
while changing the basis and is known as the passive view of the co- 
ordinate transformation. In this case, we can write 


n 
YS S- UiL;,. (2.45) 
i=l 
Using (2.42), we conclude that in this case we have 


which coincides with (2.36) and shows that this form is associated 
with the passive view of the transformation. In our discussion we have 
been using a passive transformation and (2.36) is correspondingly the 
appropriate relation. 


> Example (Translations in one dimension). Let us end the discussion of this sec- 
tion with an example of the representation of the translation group in one dimen- 
sion JT; through matrices as well as through linear operators. Let us recall that 
the group of translations in one dimension is defined by (see (1.26)) 


Okra (2.47) 
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where a is the real constant parameter of translation. Let us now introduce a 
basis in the n-dimensional vector space of coordinates through 


Om(x) = 2, m= 0,1,2,--- »n,; (2.48) 
and note that the basis functions naturally satisfy 
dpm 
Plt) = Mm—1(2). (2.49) 


We can arrange the basis functions in the (n-dimensional) column matrix (vector) 
form 


vo(x) 
y1(z) 
U,(2) = ; : (2.50) 
Un—1(2) 
so that the action of translation on this column matrix can be written as 
V(x) 2s Wale +a). (2.51) 


With this notation, we can now obtain the matrix representations of various 
dimensions for the translation group. For example, for n = 2 we obtain the two 
dimensional matrix representation 


we = (28) = @) 7 tS) - Gi) - 6 )O) 
= CG :) ot =U;(a)Vo(e), (2.52) 
qyhbie wechavelidentified 


iay= (’ :) iets, pi ¢ a (2.53) 


Using (2.49) we note that the matrix D2 acts like the derivative on the two di- 
mensional matrices, namely, 


nasne() S)(S)=(fy)=¥on as 


where the prime denotes a derivative with respect to x. Thus, (2.53) indeed 
coincides with (1.31). Furthermore, it is easily checked that the matrix U2(a) 
satisfies the group property 


U2(a)U2(b) = U2(a +b), (2.55) 
which can be compared with (1.34). 


For n = 3 we can derive the three dimensional matrix representation for the 
translation group as 


Yo(x) i Yo(x + a) 1 
W(x) = | di(x) | = | x ayy wi(cta)|] =| xt+a 
2(x) a? 2(a + a) (a +a)? 
1 0 O\/1 1 0 O\ /to(x) 
= 1 Off e@}=fta 1. Off dil) 
a’? 2a 1/)\a? a” 2a 1) \we(x) 


II 
S 
a~ 
a 
= 
i=) 
wo 
~~ 
8 
> 


(2.56) 
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where we have identified 


1 O 
U;(a)= [a 1 
a? 2a 


with 


0 0 O 
Ds={1 0 O 
0 2 0 


0 
0 
a 


2 


a 
=13;+aD3+ oi (Ds)’, 


Once again, using (2.49) it can be easily checked that 


D3V3(x) = Ww (x), 


and that the matrices U3(a) satisfy the group property 


Us(a)U3(b) = Us(a + b). 


(2.57) 


(2.58) 


(2.59) 


(2.60) 


In this way, one can obtain the n-dimensional representation for the (one dimen- 


sional) translation group to be 


Un (x) T(a) 


>W,(x4 


where we have 


1 0 0 0 
a 1 0 0 
2 
Un(a)=|% 22 1 0 
(2) a? 3a2 3a—s1 
2 n-1 
‘<- a 2 a 
=In +aPn + 57 (Pn) (n— 1) 
with 
0 
1 0 
Pa 0 2 =O 
0 0 8 0 


denoting the derivative in the n-dimensional space, namely, 


DrW n(x) = Vi, (2). 


Furthermore, it can be checked that 


Un(a)Un(b) = Un(a + 6). 


(2.61) 


(2.62) 


(2.63) 


(2.64) 


(2.65) 


It is clear from the structures of the matrices in (2.53), (2.57) and (2.62) that these 
finite dimensional representations of the translation group are not unitary. Fur- 
thermore, taking the limit n — oo, this leads to an infinite dimensional (reducible) 
representation of the translation group. (We discuss reducible representations in 


the next section.) 
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The effect of a change of basis can also be studied in this example in a 
straightforward manner. For example, let us consider the two dimensional repre- 
sentation discussed in (2.52)-(2.55). We note that if we change the basis of two 
dimensional vectors from (2.52) to 


== (86) = (122) 


= (ey tee) (1 1) (W) ously, (266) 
where 


SS ( a 2 oe ¢ *) (2.67) 


Under a translation, such a two dimensional vector transforms as 


=tel= (Gfo)) = G2) (2271) 
=(° e (2s) G20) 


= W,(a) ee (2.68) 


where we have identified 
Wwe ‘ +7 3 :) (2.69) 
sig? ‘ale G 
It can be checked in a straightforward manner that 
W2(a)W2 (b) = Ww, (a + b), (2.70) 


so that W2(a) also provides a two dimensional representation of the translation 
group. From (2.53), (2.67) and (2.69), we also recognize that we can write 


W.(a) = S~'U2(a)S, (2.71) 


which shows that the two representations are similar and (2.71) can be compared 
with (2.14). 

As we have mentioned, we can also construct a representation of the trans- 
lation group through linear operators. For example, let us identify 


d 
T D(a) = 2.72 
(a) + Dla) =e (at), (2.72) 
and note that the linear operator D(a) satisfies 


DED) exp (a2) a (2) es ((c + =) = D(a+b). (2.73) 


Therefore, the operators D(a) satisfy the group properties of translations (see 
(1.34)). Furthermore, acting on any differentiable function f(x), they lead to 


co 


Diafe)=en(ag.) f= (ag) M@=se+e, 278 
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so that they truly translate the coordinate of the function and, therefore, provide 
a representation of the translation group. 

We note that we can generalize the construction (2.72) to higher dimensional 
translation group Tw as follows. Let x = (a1, %2,--- ,an) be the coordinates of a 
N-dimensional coordinate space and consider the translations 


T 
$6 Oy se Lae or, %Jata, t=1,2,---,N. (2.75) 
where we have identified a = (ai1,a2,--- ,aw). In this case, we can consider the 


linear operator 


D(a) = exp (a: V) = exp we og] (2.76) 


which can be easily shown to satisfy 
D(a)D(b) = D(a+b), (2.77) 


which coincides with the group composition of the translation group Ty. As in 
the one dimensional translation group, we can also construct various (matrix) 
representations of Ty by restricting to appropriate functional spaces. 

Finally, let us complete the discussion of this example with a nontrivial 
one dimensional representation of the translation group 7T;. Let us consider an 
exponential function of the form 


Jo(%) =exp(ax), a0. (2.78) 
Such a function would transform under a translation as 


Jo(x) ua exp(a(xz + a)) = exp(aa) exp(az) 


= exp(aa) ga(x) = Ua(a)ga(x), (2.79) 
where we have identified 
Ua(a) = exp(aa). (2.80) 
We note that 
Ua(a)Ua(b) = exp(aa) exp(ab) = exp(a(a+ b)) = Ua(a + d), (2.81) 


so that U.(a) provides a one dimensional (1 x 1 matrix) irreducible representa- 
tion of the translation group. (We will discuss the concept of reducibility and 
irreducibility in the next section.) We simply note here that Ua(a) and Ug(a), for 
a # 8, provide inequivalent representations. 

< 


2.2 Unitary and irreducible representations 


In physics unitary transformations are quite important. Let U(a) 
denote a matrix representation of a group G of transformations. The 
matrix U(a) may or may not be unitary. (We have already seen in 
the example of the representations of the one dimensional translation, 
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discussed at the end of the last section, that finite dimensional matrix 
representations are not unitary.) If the matrices are unitary, then 
the representation of the group is known as a unitary representation. 
Even when U(a) is not a unitary matrix, if we can find a similarity 
transformation S' leading to an equivalent representation 


W(a) = S-'U(a)S, (2.82) 
which satisfies 
(W(a))'W(a)=E, forall aéG, (2.83) 


the representation is known as a unitary representation. 
Let us give a simple 2 x 2 matrix example to illustrate this. Let 
us consider the matrix 


1 fl-i 0 
mee). a9 


Clearly, this is not unitary since 


rep otl i=7 0 tte =i 
i a eer ga 0: T=9 


ao SE igh ie 2.85 


and, similarly 


Peet WSs 
UlU = — Bs. 2.86 
Ce. Ae (2.86) 


On the other hand, let us consider a nonsingular matrix 


i a ; (2.87) 
ar >. 


It is a simple calculation to see now that 


geet jaseteey as Pirie GH 1 -1 
7 AERO. eo TEE NO OF 


1 fl 2 
oi, a 
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which is clearly unitary since 
WW! =W'W = Ep. (2.89) 


In a similar manner, if we can find a similarity transformation 
S such that the n x n matrix U(a) with n = n, + ng and n4,n2>1 
(providing a representation for the group G) can be brought into 
either of the forms 


0" ita) 


W (a) = S-'U(a)S = or, (2.90) 
ee 0 ) 
N(a) Wa(a)]’ 
the representation is known as reducible. Here W (a) and Wo2(a) are 
ny, Xn, and nz X nz square (sub) matrices respectively. On the other 
hand, N(a) denotes an x ng matrix while N(a) is a ng x n, matrix. 


We note that since W(a) would also provide a matrix representation 
for the group, we have 


W(ab)=W(a)W(b), a,bEG, (2.91) 
which in turn requires that the diagonal blocks satisfy 
W\ (ab) = Wi(a)W (0), W2(ab) = W2(a)W2(0). (2.92) 


On the other hand, the off-diagonal blocks in the two cases in (2.90) 
have to satisfy respectively 


N(ab) = Wi(a)N(b) + N(a)W2(b), (2.93) 
N(ab) = N(a)W,(b) + W2(a).N(b). (2.94) 


It is clear from (2.92) that both Wi(a) and Wo2(a) provide re- 
spectively n; x n1 and ng X ng matrix representations of the group 
G. In particular, it can be easily verified that 


Wi(e) = En; W2(e) = Eng, (2.95) 


where E,, and E,;,. denote (unit) identity matrices of dimensions 1 
and nz respectively. As we have mentioned earlier, when a similarity 
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transformation S can be found which brings the matrix U(a) to the 
form (2.90), the (original) representation is known as reducible. In 
particular, if N(a) = 0 (or N(a) = 0) for all a € G, then we have 


Wwiy=s ues = Gee oe) (2.96) 


If we cannot find a similarity transformation leading to the reducible 
form of (2.96), we say that the representation U(a) is irreducible. 
More generally, if we can find a similarity transformation S such that 
U(a) can be brought to a block diagonal form 


Wi (a) 
Wala 
W(a) = S-1U(a)S = a W3(a) i 


(2.97) 


where the diagonal (sub) matrices W1(a), W2(a),--- are irreducible, 
we call such a representation fully reducible. 

The origin of the name “reducible” can be understood as follows. 
We note that if we identify the linear operator A with the group 
element a € G (see (2.17), namely a > A), then following (2.19) as 
well as (2.33)-(2.35) we have the transformation of a n dimensional 
vector under the linear transformation A given by 


Ay: = S> (S71U(a = 


j=l J 


(2.98) 


n n 
| 


For simplicity, let us consider the first structure for W(a) in (2.90). 
It is clear, in this case, that if the vector y; has the special form 


Y= (Yt, Yas° °° ac OO I (2.99) 


then, under the transformation (2.98), the transformed vector will 
have the form 


nL 


S2 (Wala) j¢4j, for i= 1,2,--+,ma, 
Ayi = 4 j= 
0, for t=nyt+1,n14+2,---,n 


(2.100) 
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Namely, the vector remains form invariant under the transforma- 
tion signifying that the linear operator A has an invariant subspace. 
(Namely, the operator A does not take a vector out of this n; dimen- 
sional space.) It is easy to see that the second form in (2.90) has a 
ng dimensional invariant space of the form 


Yi = (0, 0,--- 9, Yng+1s Uni 429°°° Unde (2.101) 


When there are no invariant subspaces for a linear operator repre- 
senting a group, the representation is known as irreducible while it 
is called reducible otherwise. Using these ideas we can prove an im- 
portant result in group theory known as Schur’s lemma (named after 
Issai Schur) which is stated as follows. 


Schur’s lemma: 


Let U“ (a) and U®)(a) denote respectively two irreducible n1 x 
ny, and ng X ng matrix representations of the group G. If there exists 
any X no matrix T such that 


UY (a)T = TU)(a), (2.102) 
with each product representing a n1 X ng matrix, then 


1. If ny A ne, then the matrix T = O (corresponds to the null 
matrix). 


2. If ny = ng and U (a), U@)(a) represent two inequivalent irre- 
ducible representations of G, then also J’ = O corresponds to 
the null matrix. 


3. If ny = nz and UW (a) = U®)(a), then the matrix T must be a 
multiple of the identity matrix, namely, 


T = En; (2.103) 


for some constant A. Here E,, denotes the n1-dimensional 
(unit) identity matrix. 


We will not go into the proof of Schur’s lemma which is very 
nicely discussed in the book by L. Jansen and M. Boon. 
> Example (Neither fully reducible nor unitary representation). As a simple ex- 
ample of a representation which is not fully reducible, let us consider the one 


dimensional translation group 7; which, as we have discussed, is a non-compact 
group. As we have seen in (2.53) it has a two dimensional representation given by 


Ula) = é 7 (2.104) 
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We note that 
U2(a = 0) = Fr, 


BAO EAG G :) € i) = ee ) iad) (2.105) 


as required. However, this representation is not fully reducible. For if it were, 
this would require that there exists a (2 x 2) nonsingular matrix S' such that (see 
(2.90) and (2.96)) 


S1Ua(a )s= (4 » Eo, 


o:. Ha Sene B= (1 He (2.106) 


which is not true. 

Similarly, we can show that this cannot be a unitary representation as fol- 
lows. Let us assume that U2(a) is a unitary representation. Then, there exists a 
(2 x 2) nonsingular matrix S such that (see (2.82)-(2.83)) 


W2(a) = S~*U2(a)S, (2.107) 
defines a unitary matrix, namely, 
(W2(a))'We(a) = E2. (2.108) 


Furthermore, since any unitary matrix is diagonalizable, there exists a nonsingular 
matrix T such that 


T 'Wo(a)T a ay (2.109) 


It now follows from eqs. (2.107) and (2.109) that 


(st) 'atayst)= (4 2), 


or, (ST)~* (Us(a) — AE2) (ST) = ea . ie ) , 


or, det ((ST)~" (Uz(a) — AE2) (ST)) = det ar : ee. ) , 
Ar — 2 0 


or, det (U2(a) — AE2) = 0 ‘ 
— 


a1 — > 0 


fe Pee oO 
“Dear 


a 1-Al 


or, (=A)? = O8= AD Oo— A); 
or, Ar = A2 = 1, (2.110) 


This, in turn, implies that 


(sry *vx(ay(st) = (7) = 


et. tala) = GT mST) = Bee ( :) (2.111) 
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which is not true so that U2(a) cannot define a unitary representation. Therefore, 
U2(a) is neither fully reducible nor a unitary representation of the one dimensional 
translation group. These facts are, of course, expected since 7) is a non-compact 


group. 
< 


The fact that a non-compact group does not have in general any 
finite dimensional unitary representation is expressed by the following 
theorem. 


Theorem. Any non-compact group cannot have a faithful finite dimen- 
sional unitary representation. (A faithful representation denotes a 
one-to-one correspondence between the group elements and its repre- 
sentation, namely, for two distinct elements g,g' € G with g # g’, the 
corresponding representations U(g),U(g') must satisfy U(g) 4 U(g') 
in order to provide a faithful representation.) 


Proof. Suppose that G is a non-compact group and that it possesses a 
finite dimensional nontrivial unitary matrix representation U(G) ina 
vector space V. As we already know (see (1.51)) any matrix element 
of a finite dimensional unitary matrix U(g) satisfies |(U(g))ij| < 1. 
Therefore, the group manifold of finite dimensional unitary matrices 
is bounded (and also closed), i.e., it corresponds to a compact space. 
Since for a faithful representation, G and U(G) are one-to-one, if 
U(G) denote finite dimensional unitary matrices, this would imply 
that a non-compact group G can have a continuous one-to-one map 
on to a compact space which is impossible. 

= 


> Example (Infinite dimensional unitary representation of T1). In section 2.1, we 
constructed (see the example) finite dimensional representations of the non-compact 
translation group in one dimension which were not unitary. We also note here that 
although the (infinite dimensional) representation in (2.72) seems to be formally 
unitary, questions such as Hermiticity, unitarity etc. cannot be discussed until a 
suitable space of functions has been specified. Let us construct here an infinite 
dimensional unitary representation of the one dimensional translation group 7}. 
Let |f),|g) denote states in a Hilbert space with the inner product defined 


by 
(flo) = > de (f|x)(a|g) = / dx f*(x)g(«), (2.112) 


where f(x),g(x) denote (in general, complex) functions of the one dimensional 
coordinate x spanning the real axis (—oo < x < oo). The Hilbert space is defined 
by the set of all L2 (Lebesgue square integrable) functions f(x) satisfying 


Co 


ll? = IF) = | da f"(x) f(a) < 00. (2.113) 


—oo 
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In such a space, the action of the one dimensional translation T; on any function 
f(a) € Le can be defined by 


f(w) s fle +a) € Le, (2.114) 


for any real constant a (—co < a < oo). It follows now that 


Wa fU(ag) = f dxf erag(e+a)= f def (oglx), 
or, (U(a)f|U(a)g9) = (flg)- (2.115) 
Since this holds for any two arbitrary L2 functions f(x), g(x), it follows that 
U'(a)U(a) = 1, (2.116) 


so that U(a) provides a unitary representation of the translation group 7}. 
Let us note that the set of functions (see (2.48)) 
Ex(z)=2", n=0,1,2,---, (2.117) 


do not define elements of the Lz space since 
(Enlén) = ‘| da a?” + oo. (2.118) 
In order to find a suitable Lz basis functions (not normalized), let us consider the 


functions defined in (2.118) with an appropriate weight (damping factor), namely, 
let us define 


gla) sae 2” See 2”) we Oa, (2.119) 
It is clear now that 
i da wy, (x)Un (x) = / da x?” = T(n+ 5) < 00, (2.120) 


always exists so that qn(a) € L2. Furthermore, under a finite translation x > r+a 
(see (2.47)) 


thn (2) Ee), Wn(x a) a (x £ a)” ew gata)? _ (a . a)"e 2% —8 eat 
= ¥> Cmn(a)a™ e~4*" = S~ Cn (a)m(2), (2.121) 
m=0 m=0 


which shows that wn(x +a) € Le. The constants Cmn can, in principle, be 
detrmined by Taylor expanding the function and give an infinite dimensional 
representation of T;. In this case, it is straightforward to check that the operator 
d 
D=—, 2.122 
a (2.122) 
is anti-Hermitian in this space, namely, for any two functions f(z), g(x) € Le, 


/ de (Df(2))*g(«) = — i de f*(«)(Da(x)), (2.123) 


—oo 
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which follows from the boundary conditions f(x), g(a) — 0 as |z| — oo (see 
(2.119)). As a result, the representation of the translation group in (2.72) 


U(a) = e*? = e8e, (2.124) 


is unitary in this space of functions. 

However, in order to calculate explicitly the unitary matrix, we need to 
change the basis functions. Let us consider the normalized infinite dimensional 
basis functions 


1 
Vr2n! 


where H,(x) denotes the nth order Hermite polynomial and 


bn() =( \F aa(oye n=0,1,2,---, (2.125) 


co 


(Bnlbm) =f dx ,(0)bm(2) = nm (2.126) 


—oo 


As we have seen in (2.121), under a translation, we can write 
U(a) _< 
n(z) —> bn(a +a) = >> Umn(a)om(2), (2.127) 
m=0 


where Umn(a) denote the matrix elements of the infinite dimensional matrix U(a) 
providing a unitary representation for 7). 

To calculate these matrix elements, let us recall that the basis functions 
¢n(x) correspond to the energy eigenfunctions of the one dimensional harmonic 
oscillator (h = m = w = 1), namely, 


; (-4 +2°) oS (n+ 5) dn(c), 


or, aladn(x) = ndn(a), (2.128) 


where we have introduced the annihilation and creation operators (we denote 
them as a,a‘ in order to avoid confusion with the parameter of translation a) 


o=Z (e+), a= (e-4), (2.129) 


satisfying 


[o,a4}=1, [a,a] =0=[al, al]. (2.130) 
The one dimensional harmonic oscillator enters into the discussion of the unitary 
representation of T; in the following way. We note that we can invert the relations 
n (2.129) o write 


eae (atl), + = (a-a'), (2.131) 


so that we can write the operator implementing translations as (see (2.124)) 


a-at) 


U(a) =e%t = eval (2.132) 
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Using the Baker-Cambell-Hausdorff formula (which will be discussed later in sec- 
tion 4.3) 


ete? = eAtB+al4.5) if (A, [A, B]] =0=[B,[A, B]], (2.133) 
we can rewrite 
7 (a at) a_qt aq a? 
U(a) =ev?2 =e v2" ev2°e 4, (2.134) 


where we have used (2.130). 
Let us recall that the energy eigenstates of the harmonic oscillator can be 
written in terms of the creation operators as 


(at)" 
Jn 


where |0) denotes the ground state satisfying a|0) = 0. The corresponding wave 
functions (eigenfunctions) are obtained from the inner product with the coordinate 
basis as 


In) = 


|0), n=0,1,2,--- ) (2.135) 


on(x) = (2|n). (2.136) 
It follows that the translated wave function can be written as 

bn(a +a) = (a +aln) = (e|U*(a)In) = (e|U(—a)|n), (2.137) 
where we have used the fact that U(a) is unitary so that (see (2.134)) 


a2 a qi 


Ut(a) =U(-a) =e F eV3™ & V8", (2.138) 


Using the standard properties of the annihilation and creation operators, it follows 
that 


m=0 


2 aman | 4)" a im), (2.139) 


where we have let m > n — m in the last step. It follows now that 


UO = —— (-%) ~ an 8°" |m) 


ee n_m Vinl)\(k+m)! (a \"-™** 
cual DP Pee les eee eC (+5) ate 
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co min(n,£) n—-2m-+l 
=e a2 _y\yn-m a (n!) (2!) 
7 » m=0 my (<5) Gane ao 


Here we have redefined k = £— m™ in the final step and have used the fact that 


1 
(€—m)! 


We have also identified 


=0, form>+1. (2.141) 


(2.142) 


where “min(n, @)” stands for the smaller of n and @. Taking the inner product 
with (|, it follows now from (2.127), (2.137) and (2.140) that 


on(% +a) = S~ Uin(a)de(2), (2.143) 


£=0 


where the constants Ue,(a) representing the matrix elements of the translation 
operator are given in (2.142). We note that the annihilation and creation opera- 
tors of the harmonic oscillator also manifest in the ray representation of the two 
dimensional translation group T2 as we will discuss later in section 4.4. 

< 


2.3 Group integration 


In section 1.3 we introduced the concept of a group manifold which 
allowed us to classify continuous (Lie) groups into compact and non- 
compact groups. For compact groups, we can also define the notion 
of a volume integral on this manifold which is invariant under the 
group action and satisfies the following relations. 


1. The volume of the group manifold is finite 


[es = < oo, (2.144) 
G 


where (2 denotes the volume of the group manifold. 


2. For any fixed element go € G, we can make the change of vari- 
ables 


97909, Of, 9990, (2.145) 
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such that 


d(go9) = d(g90) = dg, (2.146) 


namely, the integration measure is invariant under the action 
of the group. 


3. The volume integration also satisfies 


d(g~') = dg. (2.147) 


An integration measure satisfying (2.144)-(2.147) is known as 
the Haar measure in the theory of Lie groups. For non-compact 
groups such a measure may not exist. However, if a non-compact 
group is locally compact (in the sense of a topological space) we may 
define a group integration as defined above, but the volume of the 
group manifold would no longer be finite, namely, 


Jus == 00, (2.148) 
G 


in contrast to the compact case. For example, for translations in 
one dimension, 7, whose group manifold is locally compact, has an 
infinite volume as we will show in (2.165). 


> Example (Unitary groups). As we have discussed in section 1.3, unitary groups 
are compact groups. Let us begin with the example of the one dimensional uni- 
tary group U(1) (also denoted by U1). As we have discussed earlier, the group 
manifold in this case corresponds to the unit circle and every group element can 
be represented on this manifold as 


g7 U(6) =e", 0<O< 2x. (2.149) 


Therefore, every group element corresponds to a unique angle @ on this manifold 
(the unit circle) and we can identify (we emphasize here that dg is an abstract 
notation for the integration measure and does not represent the differential of 
g, more rigorously we should denote the measure as ju(dg) which we do not for 
simplicity) 


dg — dé, (2.150) 
which leads to the following. 


1. It follows from (2.150) that 


2a 
[es = ik = 2m < 00. (2.151) 
G 0 
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2. Any fixed group element go corresponds to a fixed angle 09 on this manifold 
and we have 


990 > ee = ei(9+80) (2.152) 


so that the product ggo = gog corresponds to the unique angle 0 + 05 on 
the group manifold and the group action corresponds to a translation of the 
angle 0 by a fixed 69. As a result, we have 


d(ggo) = d(gog) = (0 + 90) = dé = dg. (2.153) 
3. We note from (2.149) that 
g* + (U(6))* = U(-6) =e, (2.154) 


so that the inverse element corresponds to the point (—@) on the unit circle 
and the integration measure remains invariant, namely, 


dg ' = dg. (2.055) 


We have also noted in section 1.3 that the the group manifold for U(N), the 
unitary group in N-dimensions, can be identified with the subspace of the unit 
ball in N? dimensions which can be parameterized by N? complex parameters 
aij,t,j =1,2,--- ,N satisfying (see (1.130)) 


lou] <1, (2.156) 


such that the unitarity condition is satisfied. (Basically, we can parameterize the 
unit ball by N? real parameters, say, Re aij.) In this case, therefore, we can 
identify 

dg = [J d(Reai;)d(Im aij) 6(U" (g)U(g) — 1). (2.157) 


ij=l1 


Since the coordinates of the unit ball are bounded as in (2.156), it follows that 


fas Ae, (2.158) 
G 


< 


> Example (SO(3) group). The group manifold of 3-dimensional rotation group 
SO(3) or R(3) may be of some interest to the readers. As discussed in section 1.2.5 
(see, for example, (1.74)-(1.78) and the subsequent discussion) we may identify 
the group action with a rotation of coordinates in a 3-dimensional Euclidean space 
by an angle @ around a fixed 3-dimensional unit vector n 


n=(ni,no,n3), mn? =n? +nz3+n3=1, (2.159) 


which can be represented as (n, 0) as shown in Fig. 2.1. However, since (n, 6) and 
(—n, 27 — 0) correspond to the same rotation, we need to restrict the range of the 
angle of rotation 0 to 0 < 6 < a. Therefore, we see that the group manifold of 
R(3) can be identified with a three dimensional space consisting of all vectors of 
the form r = 0n with 0 < 6 < 7a. Since n represents a unit vector, it follows that 


2 = r’ = 0”, (2.160) 
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O 


Figure 2.1: A rotation by an angle @ around a unit vector n in the 3-dimensional 
Euclidean space with O denoting the origin of the coordinate system. 


so that the group manifold corresponds to the interior of a three dimensional 
sphere of radius 7. In spherical coordinates, we can parameterize the components 
of the unit vector n as 


n = (ni, N2,n3) = (sinacos Z,sinasin B, cosa), (2.161) 


where 0 <a<a7and0< 6 < 27. With this the group invariant volume element 
can be written as (recall that in spherical coordinates d?a = r?dr sin 0d@d¢) 


dg = 07d0 sinadad§, (2.162) 


where, as we have noted earlier, 0 < 6 < 7m. (In this case, [dg = oe 


However, the group manifold for R(3) has the following subtlety. We note 
that (n,7) and (—n,7) represent the same rotation. Therefore, we must identify 
the diametrically opposite points on the surface of the sphere (namely, the two 
points representing the unit vectors n and —n on the surface of the sphere). This 
leads to the fact that the group manifold for rotations is not simply connected. 
This is the origin of the spinor representations which are double-valued represen- 
tations of R(3) (or SO(3)). In contrast to R(N), the group manifolds of U(N) 
and SL(N) are known to be simply connected (note that SL(N) is a non-compact 
group, see the discussion after (1.132)). 

< 


Let us also note here that any finite group, such as the permuta- 
tion group Sy (see section 1.2.1), can always be regarded as compact 
with the group integration 


| dg = (2.163) 
X 


The group invariant volume, in this case, basically counts the total 
number of independent elements in the group and since the finite 
groups have a finite number of elements, the group invariant volume 
is finite. 

In contrast to these examples, we note that the group manifold 
for the one dimensional translation group T{ is the real line. Every 
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group element can be uniquely specified on this manifold by a point 
“a” denoting the parameter of translation. Consequently, we can 
write 


dg = da, (2.164) 


so that the group volume follows to be 


us i j da = 00; (2.165) 
G —oo 


in spite of the fact that the real line is a locally compact manifold (as 
we have mentioned earlier). 


2.4 Peter-Weyl theorem 


One important difference between the compact and non-compact 
groups is given by the Peter-Weyl theorem (named after Hermann 
Weyl and his student Fritz Peter) which essentially implies the fol- 
lowing. First, for a compact group, any of its representations is always 
finite dimensional, fully reducible and equivalent to a unitary repre- 
sentation. In contrast, for a non-compact group, its representations 
may not be fully reducible and the only two unitary representations of 
the group are either the one dimensional trivial representation or an 
infinite dimensional representation as we have already noted in con- 
nection with the example of the one dimensional translation group 
T, earlier. Let us prove some of these assertions below. 


2.4.1 Fully reducible representation. Let us change notation (for a rea- 
son to be clear soon) and denote the finite dimensional (n xn) matrix 
representation of a (compact) group element by U(g) instead of U(a) 
as we have done earlier. Let us assume that U(g) is reducible and 
write it in the form (see (2.90)) 


U(g) = ee Ae , (2.166) 


Let us identify 


W(g) = Gy re) (2.167) 
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Since U;(g) and U2(g) correspond to representations of G, as we have 
already seen in (2.92), it follows that 


W(91)W (92) = W(g192). (2.168) 

Let us next define 

S= Jos W(g"')U(g), (2.169) 
res 


where the integral exists since W(g~+) and U(g) are finite dimensional 
matrices. For any fixed element go € G, it follows from (2.169) that 
(the reason for changing notation for the group element is to take 
advantage of the group integration property below) 


W(95")SU (go) = / dg W(952)W(g-})U(@)U (90) 
G 
= [es W (99 '9-')U (gg) 
G 


= f a(ago) W((990)-")U (990) 
G 


= / dg W((g')-1)U(g!) = 8, (2.170) 
G 
which leads to 
W (99 ')SU(g0) = S, or, SU(go) = W(g0)S. (2.171) 


Since go is any arbitrary element of the group, (2.171) can be written 
in general as 


SU(g) = W(g)S. (2.172) 


Let us note that 


W(g"!)U(g) = Ce 0 eae Hoy 


(2.173) 
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where E,,, and E,, are respectively ny Xn and nz X nz unit matrices 
(with nj + ng =n). Therefore, we can write 


QED K 
= -1 = 
s= fasworwe=("% age): (2.174) 
G 
where we have identified 
Y= [os k= funn. (2.175) 
G G 
It follows from (2.174) that 
det. Ss = OP SO eG, (2.176) 


so that S~! exists and we can rewrite (2.172) as (see also (2.167)) 


SU(g)S7 = W(g) = @s a) (2.177) 


This proves that U(g) is fully reducible since if Uj(g) and/or U2(g) 
are reducible, we can follow the same procedure for U;(g) and U2(g) 
as well. 


2.4.2 Unitary representation. Let us next show that any (finite dimen- 
sional, n x n) matrix representation U(g) of such a (compact) group 
is equivalent to a fully reducible unitary representation. To prove 


this, let us define 


T= | dg Ut (g)U(g). (2.178) 
G 


Since UT(g)U(g) is a Hermitian and positive matrix (namely, all the 
eigenvalues are positive), so is T. Let us next note that for any fixed 
element of the group, go € G, 


U'(g0)TU (go) = / dg U* (go)U*(g)U(g)U (go) 


G 
= f ag (U(aU(@o))'Ua90) 
G 


- / due anu Gan) 7. (2.179) 
G 
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Since go is arbitrary, we can write the relation (2.179) in general for 
any element of the group as 


U'(g)TU(g) =T. (2.180) 


As we have seen, T is a Hermitian and positive n x n matrix so 
that we can write it as 


= sis, (2.181) 


for some nonsingular (namely, S~! exists) n x n matrix 9. Therefore, 
it follows from (2.180) that 


) 
U'(g)S'SU(g) = S'S, 
or, (SU(g)S~1)' (SU(g)S7!) = En. (2.182) 
Therefore, if we define 
W(g) = SU(g)S“*, (2.183) 
we can write (2.182) as 
W'(g)W(g) = Ens (2.184) 


namely, W(g) isan xn unitary matrix. This proves that the repre- 
sentation is equivalent to a unitary representation, and furthermore 
as we have shown earlier this is also fully reducible. 


2.5 Orthogonality relations 


Let U(g) and U)(g) denote two ny x ny and ng x ng unitary 
irreducible matrix representations of a compact group G. Then we 
can prove the following relations in a straightforward manner. 


1. If UY (g) and U®)(g) are inequivalent representations, then 
i dg (ui(9)) U2) (g) =0, (2.185) 
G 


for all 1,9 =1,2,--- ny and u,v = 1,2,--- ,N2. 


2. For ny = nz and U“)(g) = U)(g) (equivalent representations), 
we have 


feo ( dg (UP) ) oP) = Minds, (2.186) 
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for i,j,k,@=1,2,--- ,n, and the constant A is given by 


recs (2.187) 


ny 


where 2 denotes the volume of the group manifold defined in 
(2.175). We note that for a finite group G such as the symmet- 
ric group S;,, we replace the integration over the group man- 
ifold J dg by the summation (over all group elements) )> as 
gEG 
in (2.163) so that the volume 2 corresponds to the total num- 
ber of independent elements of G. Thus, for example, we have 
Q = N! for the group Sy. 


To prove these statements, let MZ denote an arbitrary n1 x n2 
constant matrix and define another n; x nz (constant) matrix T by 


_ / dg UO (g-) MU (q). (2.188) 


Then, for any fixed element go € G, we have 


UY (gg)TU™ (go) = / dgU (g9t)U (g-!) MU (g)U@) (go) 
= _ (go ‘9 1) MU (ggo) 
= d(ggo) U™ ((ggo)~1) MU (go) 


ars U4 ((g')-1)MU)(g') = T. (2.189) 


Since go is an arbitrary element of the group, we can also write (2.189) 
in general as 


UY (g)TU (g) =T, 
or, UY(g)\T = TU) (4g). (2.190) 


It is clear now from Schur’s lemma (see (2.102)) that if U)(g) and 
U?)(g) denote two inequivalent and irreducible representations, then 


T =0, (2.191) 
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which leads to 
i dg UO (g-) MU (g es dg MyU?(g)U2(g) = 0, 
(2.192) 
where summation over repeated indices is assumed and 7, 7 = 1,2,--- , nq 


while p,v = 1,2,--- ,ng. On the other hand, the matrix M is an ar- 
bitrary n, X ng matrix so that for (2.192) to hold, we must have 


[es Us) (9 UD (9) = 0, ee) 
G 


for any i,j = 1,2,---,n, and p,v = 1,2,---,ng. Since UO (g) is 
unitary, it follows that 


or, UY) (gt) = (uP@) (2.194) 


Therefore, (2.193) can be written as 


feo(e Ug )) U2 @) = 0, (2.195) 


which proves (2.185). 

On the other hand, when n; = nz and U“)(g) = U®)(g) (equiv- 
alent representations), we note again from Schur’s lemma (2.103) that 
for 


UY (g)T = TU (g) (2.196) 


to hold we must have T = \E;,, so that we have from (2.188) 


/ dgU(g-!) MU (9) = XBny. (2.197) 


Taking the trace of both sides of (2.197) and using Tr E,, = ni as 
well as 


Tru) (9-1) MU) (g) = Tru (g)U™ (g-!)M = TrU(e)M 
=TrE,,M =TrM, (2.198) 
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we determine 


1 1 
A=—TrM | dg=—OTrM. (2.199) 
NY NY 
G 


Substituting this into (2.197) we obtain 


1 
il dgU (g-1)MUY = — NE, TrM, 
nm 
a 1 
1 
or, / dg MU) (gt) UP = — 06 je TrM. (2.200) 
1 
G 


Since M is an arbitrary n; x n, constant matrix, using (2.194) we 
conclude that 


* 1 
Ju (uo) Uy (9) = wy Siw bie (2.201) 
G 


which proves (2.186). 


2.6 Character of a representation 


Let U(g) beanxn matrix representation (not necessarily irreducible) 
of a compact group G. We call 


x(g) = TrU(9), (2.202) 


to be the character of the representation. Since U(e) = Ey, we obtain 
from (2.202) that 


yey TF U (ey STE = ti (2.203) 


where n denotes the dimension of the representation space. 
If U(g) is fully reducible and is a direct sum of irreducible repre- 
sentation matrices U\(g), U@)(g),--» ,U'(g), namely, (see (2.97)) 


S-1U(g)S = e. , (2.204) 
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then taking the trace of both sides of (2.204) and noting that 
Tr (S~'U(g)S) = TrU(g) = x(9), (2.205) 


we obtain 
Nk 
x(g) = >> xal9), (2.206) 
a=1 


where Ya(g) denotes the character of the a-th irreducible representa- 
tion U((g). 

Since Tr (S~'U(g)S) = TrU(q), the character of a represen- 
tation is independent of the choice of a particular equivalent repre- 
sentation. Moreover, the number of all possible irreducible repre- 
sentations of a compact group is known to be at most denumerable. 
Hence we may denote the characters of the (irreducible) submatrices 
as x1(g);X2(9),x3(g),-+* in such a way that va(g) # xo(g) if U(g) 
and U() (g) are inequivalent representations. In the orthogonality 
relations for U(g) and U(g) in (2.185)-(2.187), if we set i = 7 
and = v and sum over 7 as well as yz, we obtain the orthogonality 
relation for the characters to be 


/ dg x2(g)xs(9) = Sap. (2.207) 
G 


where, for irreducible representations U( (g) and U“)(g), 


1, if U\)(g) and U\(g) are equivalent, 
ab = - 77a) (b) : . (2.208) 
0, if U'Y(g) and U\)(g) are inequivalent. 


The characters of irreducible representations of classical groups such 
as U(N), SU(N), SO(N) as well as the symmetric group have been 
computed by H. Weyl. 
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CHAPTER 3 


Lie algebras 


3.1 Definition of a Lie algebra 


Let us start with a the formal definition of a Lie algebra L as follows. 


(L1) 


(L3) : 


: The Lie algebra LD is a vector space over a field F' which, in 
our discussions, will be assumed to be the field of either real or 
complex numbers. In other words, for any x,y € L and for any 
real or complex numbers a, 3,A, the product ax € L and the 
sum «x+y € L are defined and satisfy the conditions 


a(Bx) =(aB)x, ax + By) = (Aa)a + (AB)y. (3.1) 


: For any two elements of the Lie algebra, a bilinear product 
[x,y] € L is defined in L satisfying the anti-symmetry condition 


[2,y] = ly, a]. (3.2) 
Here by bilinear product, we imply the validity of the condition 

lax + By, 2] = ala, 2] + Bly, 2], (3.3) 
for any a, 8 € F and for any 2, y,z € LD. 


Finally, for any three elements xz, y, z € L, the bilinear product 
is assumed to satisfy the Jacobi identity 


[x,y], 2] + lly, 21,2] + [lz, 2], y] = 0. (3.4) 


Any vector space L satisfying (11)-(Z3) is known as a Lie al- 


gebra. If the field F is real or complex, then L is correspondingly 
called a real or a complex Lie algebra. Unless otherwise stated, we 
will assume that D is a complex Lie algebra in all our discussions. 
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(L3’) : We note here that the condition (£3) can also be written in 
the equivalent form 


[x, [y, 2]] + ly, lz, el] + [z, fe, yl] = 0, (3.5) 


which can be seen as follows. First, using the anti-symmetry 
property of the bilinear product (3.2), we can write 


[[z, 9], 2] = —[z,[z, yl], [y, [z, e]] = -[lz, 2], 9], 

[z, [z, y]] = —[lz, y], 2], (3.6) 
and substituting this into (3.4) we obtain 

[[z, y], 2] + (ly, 2], 2] + [[z, 2], y] 


= —[z, le, 9]] — la, ly, 21] — [y, [z, 2] 
= — (@, [y 2l] + ly, fz, el] + [z, fe yl) = 9, (3.7) 


which coincides with (3.5). 
We note that if 
[x,y] =0, forall z,yeL, (3.8) 


it corresponds to a Lie algebra which is called Abelian. A Lie al- 
gebra is called non-Abelian otherwise. It is worth emphasizing here 
that [x,y] is an abstract bilinear product and need not necessarily 
correspond to a commutator 


[x,y] = zy — ya, (3.9) 


since the product xy may not itself be defined in this context. How- 
ever, if A defines an associative algebra such that for x,y,z ¢ A 


(xy)z = x(yz) = xyz, (3.10) 


then the commutator algebra L = [A, A] is a Lie algebra defined by 
the bilinear product [x,y] = xy — yx. We note that in this case we 
have [x,y] = z € L and we can write 


[[z, y], 2] = [z, ylz — 2[z, y] = (wy — yx)z — 2(xy — yx) 


ryz — yxz — zxyt zye, (3.11) 
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which leads to 


[x.y], 2] + (ly, 2], 2] + [[z, 2], 9] 
= LYZ — YXZ — ZXY + ZYX + YX — LyX — LyZ+ ULZy 
+ zry —xzy —yzxn+yxz =0, (3.12) 
so that the Jacobi identity is satisfied. 
For a Lie algebra L, let us suppose that we can find nxn matrices 


(or more generally, linear operators in a Hilbert space) p(x) for all 
x € L (recall that matrices define a vector space) satisfying 


(i) p(x) is linear in x, namely, 


plax + By) = ap(x) + Bply), (3.13) 
for any x,y € L and for any real or complex numbers a, £. 


(it) p(x) also satisfies 
pla, yl) = lo), py] = p@)e(y) — py)pl@). (3.14) 


We note that since 


[p(x), p(y)] = p(x) ely) — ply)e(2), (3.15) 


p(x) generates a Lie algebra because the matrix product p(x) p(y) is 
associative. In this case, we say that p(x) provides a (linear) n x n 
matrix representation of the Lie algebra L. (It is worth pointing 
out here that even though the matrix product is associative, in gen- 
eral, there is no guarantee that [p(x), p(y)] = p(z), for some z € L. 
However, by our assumption (3.14), if [z,y] = z € L, it follows that 
p([x,y]) = plz) = [e(2), p(y)] € £ and p(x) satisfying (3.13) and 
(3.14) defines a Lie algebra.) 

A familiar example of a matrix representation of a Lie algebra 
in quantum mechanics is given by the spin matrices 


1 
es ee 1, 2,3, (3.16) 


where o; denote the three Pauli matrices 


0 1 O —2 1 0 
a=(3 a oe = (\ a v= (5 ae (3:17) 
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which satisfy the angular momentum (Lie) algebra 
[S1, S2| = 183, [s, $3] = 181, (53, s1| = 182. (3.18) 


Here 1 = /—1 and the algebra (3.18) can be written in the more 
compact form 


Si, $5] = 1ejR Se, (3.19) 


where the subscripts 7, 7,k = 1,2,3 and €;;, represents the three di- 
mensional Levi-Civita tensor which is completely anti-symmetric with 
(93 = 1. 


3.2 Examples of commonly used Lie algebras in physics 


In this section, we describe some of the more common examples of 
Lie algebras in physics as well as their representations. 


3.2.1 Lie algebra of gi(V) and si(N). Let x,y,z denote any three N x 
N matrices and we define the bilinear product of any pair of them 
[x,y] to correspond to their matrix commutator, namely, 


[x,y] = zy — yx, (3.20) 


so that we have a Lie algebra. Such a Lie algebra without any restric- 
tions on the matrices is known as gl(N) (gl stands for general linear) 
since it is intrinsically related to the general linear group GL(N) as 
we will explore later. Furthermore, (3.20) gives a N-dimensional ma- 
trix representation of gl(N) known as the defining representation. 
(There can be several possible representations of a Lie algebra, not 
just the defining representation, corresponding to other possible n x n 
matrix realizations as we will see and as we know from our studies 
on the representations of the angular momentum algebra.) 

If we restrict the matrices x,y,z further to be traceless N x N 
matrices, 


ee = Tey lr 0, (3.21) 


then, all such matrices also define a Lie algebra. We note that for any 
two finite dimensional matrices x,y, the trace of their commutator 
vanishes, namely, 


Tr [x,y] = 0, (3.22) 
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because of the cyclicity of trace, so that [x,y] also belongs to the Lie 
algebra (it is a traceless N x N matrix). The Lie algebra of traceless 
general N x N matrices defines the special linear Lie algebra sl(NV) 
which is related to the special linear Lie group SL(N) (see section 
1.2.4). 


3.2.2 Lie algebra of so(N). Let us define a set of second rank anti- 
symmetric tensor operators in a Hilbert space 


Kup = Avis. “Pe SZ Ne (3.23) 


which satisfy the commutation relations (wu, v,a, 3 = 1,2,--- ,N and 


i=/—1) 
[Xp Xa] = XywXop — XapXp 
= 1 (OpuaXvp + bv6X pa _ bupXve. _ dvaX pp) . (3.24) 


This defines a Lie algebra known as the so(NV) Lie algebra and is 
related to the Lie group SO(V) and the operators X,,, are known as 
the generators of the Lie algebra so(N). Because of anti-symmetry, 
there are only 4. N(N — 1) such operators. (We note here that the 
Lie algebra is defined up to a scaling of the operators X,,, and here 
we have chosen the operators to be Hermitian.) 

A familiar example of so(N) can be given as follows. Let (4, 5,.) 
with p,v =1,2,--- , N denote the canonical coordinate and momen- 
tum operators in N-dimensional quantum mechanics so that we have 
the canonical commutation relations (we assume fi = 1) 


[Bas Purl 7 =[pr tpl = Wu, 
[eu,%)] = 0 = [Dy br], (3.25) 
which can be realized in the coordinate representation as p,, = —igk. 


If we now define 
Luy = fpPv — LyPp = —Lp, (3.26) 
then it can be checked, using (3.25), in a straightforward manner that 
[Luv Lagl =4 (SpoLve + bp Lya is bupLva _ dvoLyp) F (3.27) 


so that it satisfies the so(N) Lie algebra (see (3.24)). Ly, can be 
thought of as the angular momentum operators generating rotations 
in the p-v plane. 


68 3 LIE ALGEBRAS 


In quantum mechanics, (%,,, ),,) are Hermitian operators, namely, 
al, = 2,,p1, = By, so that 


E,, = (Br)! Bs)! — (6)! (GL)! = By — Buby 
= Dip tp = Lis (3.28) 
namely, they are Hermitian. 
For N = 3, for example, we recognize that there are only three 


independent angular momentum (L,,) operators which we can label 
as 


Ly = Lo3 = —L32, Le = L3, = —L13, L3 = Lyg = —L1. (3.29) 
It now follows from (3.27) that 
[Li, Lo] =iL3, [Le,L3]) =ik1, [L3, Li] = iLe, (3.30) 


which we recognize to be the familiar angular momentum algebra (see 
also (3.19)) 


[Li, L5] _ 1eigk Le, C55 -Ke 7 1, 2,3, (3.31) 


with L' = L;, i=1,2,3. 

We can now construct some finite dimensional non-spinorial rep- 
resentations for the so(NV) Lie algebra as follows. Let us first consider 
the case when the operators L,,, act on constants C’. From the fact 
that py,C = —ige = 0, we conclude from the definition (3.26) that 

LwC = 0, (3.32) 


which furnishes the trivial one dimensional representation of so(N) 
corresponding to the zero angular momentum state ¢ = 0 in the case 
N =3. 

When L,, acts on a linear monomial x), then we have p,7), = 


ig = —16, so that 
Ly > (fpPv = DyPaycr = —4(dyy ty — Ox ps (3.33) 


and x) lead to the N-dimensional representation for L,,,. In general, 
finding a vector space on which the operators act leads to a repre- 
sentation for them in the following way. In this particular case, we 
note that x, = (%1,%2,-:- ,#Nn) represents the coordinate vector of 
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the N-dimensional vector space Vj. Since the coordinates x) define 
a complete basis in this space, we can expand 


N 
Lyy& = dX UB (Zw) Br’ (3.34) 


and comparing with (3.33) we obtain the N x N matrix representation 
for the matrix Ly, = p(Lyv) as 


(Zw) op =-1 (Oa Oup = bv05 np) ) (3.35) 


for w,v,a, 8 = 1,2,--- , N and it can be checked that it satisfies (3.27) 
with the standard matrix products, namely, 


ees Ly,| =1 (,rLup + Os Lie _ baka _ bv»Lyp) ; (3.36) 


This gives the N dimensional (N x N matrix) representation of so(V), 
namely, the defining representation. For N = 3, for example, we can 
determine (using the identification in (3.29)) 


7 0 0 0 7 0 0 -1 

Iy=-i{0 0 1], In=-if0 0 0], 
Oh i 10 0 
0 10 

I3=-i[-1 0 Oo], (3.37) 
0 0 0 


which are manifestly Hermitian and purely imaginary and, as we 
know from quantum mechanics, provide the 3x3 matrix reprsentation 
corresponding to the angular momentum value ¢ = 1. 

Similarly, let us consider a symmetric second rank tensor of the 
form 


Dyp = XU)Up = ob px (3.38) 


From the fact that p,Ty, = —j ete = —i(dnr,Fp + duptr), We can 


now directly evaluate and check that 


LuvTy =1 (Gu tve “F Ovpt pr _ Ouplvr _ dvr»T yp) ’ (3.39) 


so that T,,, = T,, in (3.38) defines a (matrix) vector space on which 
the operators L,,, act and leads to a matrix representation of dimen- 
sion $N(N +1) for the operators (as in (3.34)-(3.35)). However, such 
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a representation is not irreducible which can be seen as follows. If we 
set 


fe (3.40) 
where repeated indices are summed, it follows from (3.39) that 
Lgl =0, (3.41) 


and T corresponds to the trivial one dimensional representation (see 
also (3.32)). As a result, we can construct the traceless symmetric 
tensor (repeated indices are summed) 


1 
Syp = Trp — W Ol = Spas Sy, = 0 (3.42) 
and it follows from (3.39) and (3.41) that it also satisfies 
Ly Srp = t(OurSvp + dupSpr — SupSvr— dyrS up) ; (3.43) 


Th action of the operators L,,, does not take a vector out of the vector 
space defined by S),. Therefore, Sy, also leads to a representation of 
so(N) which is }N(N +1) —1 dimensional. Noting from (3.42) that 
Tigo x d\p1, we conclude that the 53 N(N + 1) dimensional 
representation of so(V) generated by Ty, is fully reducible since the 
vector space T) is a direct sum of 5), and T (see also the discus- 
sion around (2.98)-(2.101)). For the case of N = 3, the traceless 
symmetric tensor S), corresponds to the angular momentum state of 
¢ = 2. This is easily seen in the spherical coordinates (r, 6, ¢) instead 
of Cartesian coordinates (7 ,2%2,x73). For example, in the spherical 
coordinates, we can write 


x1 =rsin9@cos @, 


t2 =rsin@sin d, 


x3 = rcos#, (3.44) 
leading to 
te =a, tixo =rsinde*"?. (3.45) 


We know that for @ = 2, the azimuthal quantum number takes the 
values m = +2,+1,0. The spherical harmonics which are known 
to provide a representation for the angular momentum states define 
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a 5-dimensional space in this case and can be expressed as (with a 
suitable normalization) 


1 al 
15 \2 , 49; 1 15 \2 
Y2,42(0, 6) = (=) sin? Oe*?"? — 2 (=) LLE+, 


i J 
Y241(6, ~) =F (=) : sin 6 cos bex'? a ns ( 2 ) : ($0423) ’ 
1 


Ya0(0.6) = ( Z ) (Geos? = 1) 


167 


1 — 
1 15 \2 /2 
= 2 (sx) 2 (203 nee ie (3.46) 


3.2.3 Lie algebras of u(N) and su(N). Let a, and al, denote respec- 
tively the annihilation and creation operators of a N-dimensional 
harmonic oscillator satisfying 


[au, ay] =h= [a},, al], [ep, al] — On (3.47) 
for u,v = 1,2,---,N. Here and in the following discussions we use 


the notation that a “super” index corresponds to that of a creation 
operator and define, 


Gi ap. (3.48) 


It can be checked easily using (3.47) that 


Bees al] — [afiay, ai] = a} lay, a] = ba 3 
[Xs gl = [a}ay, da] — [a},, dalay = —dhay. (3.49) 


Using this, it follows that 
[X%,, X%] = [X4,, alag] 
= [X4,, ah]ag + a) [X%,, ag] 
= doal.ag — Shah ay 
= Op X's - 5RX4, (3.50) 


for w,v,a, 8 =1,2,---,N. Any set of linear operators (or matrices) 
X*‘, satisfying the commutation relations (3.50) defines the Lie alge- 
bra u(NV) which is connected to the Lie group U(N). The operators 
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X*, are known as the generators of the Lie algebra u(N). From the 
definition in (3.48) we note the Hermiticity condition 


(XE (3.51) 


Let us note that the operator (the repeated index a is summed 
over a = 1,2,--- ,N) 


X = X%,, (3.52) 


commutes with X“, for all u,v = 1,2,---,N. Namely, using (3.50) 
we have 


[XX] = [XX] 
= §%XM — §H#Xo = XH — XH = 0, (3.53) 


The operator X is an example of a Casimir invariant (named after 
the Dutch physicist Hendrik Casimir) of u(NV) Lie algebra which will 
be explained in more detail later in this chapter (as well as in chapter 
10). 

Let us next define an operator 


1 
AM = XH — — 6H X, 54 
v v— Fy ov (3.54) 
Then it follows that 


(i) the trace of A“, vanishes 


1 
AM, = Xt, — GNX =X-X=0. (3.55) 


(ii) the operator A‘, defined in (3.54) also satisfies the u(N) Lie 
algebra, namely, 


[aA 3] = Bae _ wv Xs XB — w 8 ] 
= [X4,, X%] = 62x", — 54x 
= jeAH Ge Ae, (3.56) 
where we have used (3.53) as well as the definition in (3.54). 


Any set of linear operators satisfying (3.55) and (3.56) defines the 
su(N) Lie algebra which is related to the Lie group SU(N). 
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The special case of N = 2 is of some interest in physics. If we 
identify (recall that, in this case, we have A!, + A, = 0 by definition 
(3.55)) 


1 


J3 = 3 (A‘, — A’g) = Al, = -A%, 
J, = Aly, 
J. = A?, = (Al), (3.57) 


where we have used (3.51) as well as (3.55), then it is easy to check 
using the algebra in (3.56) that 


[J3, J] = tJz, [J4 JJ. | = 2J3. (3.58) 
Redefining 
Jp = SJ £tJ2, (3.59) 


we can rewrite the commutation relations in (3.58) in the compact 
form 


[dis Jj] 7 1eijkJk, 1,9; k= ie 2, 3, (3.60) 


which we recognize to be the angular momentum algebra in 3-dimensions 
(see also (3.31)) and it follows from the definitions in (3.57) and (3.59) 
as well as the condition (3.51) that 


(i)' =J;,, i=1,2,3, (3.61) 


so that the generators of the algebra are Hermitian. Equations (3.31) 
and (3.60) show the well known equivalence of the Lie algebras so(3) 
and su(2). 

We can construct matrix representations of u(V) as follows. Let 
us denote by |0) the vacuum state which satisfies 


a,|0) =0, p=1,2,---,N, (3.62) 
and construct a completely symmetric tensor of rank n as 


TAB 2 Hn at. fae al, |0), (3.63) 


where pt; < N, j = 1,2,--- ,n. (Namely, the totally symmetric tensor 


operator al, ah, nee al, acting on vacuum creates a symmetric tensor 


n 
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state TH2"Hn) Using (3.50) as well as (3.62) we have X3|0) = 0 
and hence it follows now that 


n 
De Teen = 5 a a ae aia 
j=l 


AQg THE Hn — ( Sins 5X) Tisai 


n 
= epee ms sensea tn — Tog rata, (3.64) 
j=l 


Therefore, the symmetric tensor T#142""#" provides an irreducible 
matrix representation, for both u(N) and su(N), of dimensionality 


d= N(N+1)(N +2)--(N+n-1) 


See Gas) (3.65) 


~ (N-1)!nl n 


For N = 2, we are dealing with the Lie algebra su(2) ~ so(3) and 
(3.65) yields 
(2+n—1)! (n+1)! 
= — — } 1 _ i. 
= nm ‘oo 00) 


In this case, with the conventional identification d = 27 + 1 where j 


denotes the angular momentum quantum number, the representation 


(3.63) describes the angular momentum states of 7 = a) = = so 
that 7 = 5; Ls 252, --+ corresponding to n = 1,2,3,---. 


Returning to the u(NV) algebra, we note that the tensor 


HipoHn — pt at , 
it = Gu Bs 


--al, |0), (3.67) 
with p1,°--,fn = 1,2,---,N, is totally symmetric in its indices. 
Therefore, we cannot construct any more complicated representa- 
tions from this tensor through the use of the Young tableaux symbol 
[f1; f2,°:- » fn] (Young tableau will be discussed in chapter 5). There- 
fore, in order to construct such representations, let us introduce N? 
independent annihilation (and creation) operators al us (a)t with 
Lv,j,k =1,2,---,N satisfying 


fal), ( 


fa), al) =0 = [(al)' , (al), (3.68) 


T 
a?)'| = 68 
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With these let us construct the mixed tensor 
A\t 
x" = (a?) afd), (3.69) 


where the repeated index (j) is being summed. It is straightforward 
to check using (3.68) that these operators also satisfy the u(V) Lie 
algebra (3.50), namely, 


[X#,, X%] = 60x", — SEXY. (3.70) 


We can now form the more general tensor (state) 


Teale ln a Ch Fang (ali»)' “ee (aij)! 10), (3.71) 


Sis jn=1 


for constant tensor coefficients Cj, j.,...;, which satisfy the Young 
tableau symmetry [f1, fo,--:, fn]. This automatically leads to the 
tensor T#!"# satisfying the Young tableau [f1, fo,--- , fn]. Such 
a representation theory of u(N) has been extensively studied by 
Biedenharn and Louck. An alternative method is to use fermionic 
annihilation and creation operators a, and al, satisfying the anti- 
commutation relations (corresponding to a fermion oscillator system) 


T T — sv 
GyA}, + a, ay, = Oi 


Apdv + aya, =0= ala}, + alal, pi = 1,23 NN, (3.72) 


and define 
= ala. (3.73) 
In this case, the state 


Le ies aa vee al, |0), n<QN, (3.74) 
would provide the completely anti-symmetric (tensor) representation 
of the Lie algebra. However, we would not go into details of such a 
representation here. 

We remark here that if we define an operator (in the the coor- 
dinate representation) as 


* Ooo es alll 6) 0 
Y= Hon, 5 by = 5 (sug + ts] : (3.75) 
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which is anti-Hermitian (remember our convention of upper index 
corresponding to the creation operator) 


Vv T — Vy =; 
(YP HV (AYO av Ht te oN, (3.76) 
also satisfies 
Vy B| _ sB Vy Vy B 
Ya ter See ty (3.77) 


which gives the same u(N) Lie algebra relation as in (3.50) with the 
identification Y,,” = —X ". However, this form of the generator corre- 
sponds to a rather non-compact realization of the g@(N) Lie algebra. 
For example, if we express x, and oom in terms of annihilation and 


creation operators (of a harmonic oscillator system) as 


i O 1 0 
an = Va («, + in) ; a}, = Va («, = i) . (3.78) 


then we obtain from (3.75) that 
1 1 
be = (a + al.) (a, - al.) =5 O7. 


(aur al ay ala, — al, al) (3.79) 


where we have used the commutation relations in (3.47). It follows 
now that Y,,” acting on the vacuum state generates an infinite number 
of states leading to an infinite dimensional representation which is 
unitary for the corresponding Lie group GL(N) (we will discuss the 
connection between the Lie algebra and the Lie group in the next 
chapter). On the other hand, we note that Y,/ in (3.75) operating on 
a finite dimensional tensor 


Typ pin = Puy Lyg*** Cyn (3.80) 


gives a finite dimensional representation of the corresponding Lie 
group GL(N) which is not unitary. 

Finally, returning to the Lie algebra so(3) ~ su(2) (discussed in 
the last section and see also (3.57)-(3.60)), we introduce two sets of 
creation and annihilation operators as 


a=a,, b=ao, (3.81) 


and construct the operators 


1 
I= 5 (ala = oto) , Jz=alh, J_=bdla. (3.82) 
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It can be checked in a straightforward manner that the operators 
satisfy the so(3) ~ su(2) Lie algebra. In this case, the standard 
quantum mechanical state vector (angular momentum eigenstates) 
\j,m),m = j,j —1,---,—j with 7 integer or half integer can be 
realized as 


om) = at)" (r) "Io, (8.83) 


which satisfies the relations 


J3|3,m) = m|j,m), 


Ja|j,m) = /GFm\GEm+tij,m+ 1), (3.84) 


and can be checked to satisfy the orthonormality relation 
(j, mj’, m’) = 653 5mm’- (3.85) 


This construction of the representations of the angular momentum 
algebra is due to J. Schwinger. We note that with respect to the 
inner product (3.85) the generators satisfy the Hermiticity properties 


(J3)' =J3, (J4)' = Je. (3.86) 


In terms of the tensor T¥1"“" defined in (3.71) for the su(2) al- 
gebra, we note from (3.81) as well as (3.83) that this construction 
corresponds to the choice of the constant coefficients in (3.71) to cor- 
respond to 


Ogigwe fee = Cla ped Bone 2: (3.87) 


We also note that if we define J3 and J by (as opposed to (3.82)) 


1 
J3 = D) (ata + btb+ 1) ¢ PS 1a, Je ila, (3.88) 


these generators can be checked to also define the angular momentum 
algebra (see, for example, (3.58)) 


Powel Seede.. Supa] = 2s: (3.89) 


However, in the present case we note that 


(Js)' = Js, (Ja)! = —Jg, (3.90) 
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This change in the Hermiticity of the generators is reflected in the fact 
that the corresponding Lie group, in this case, corresponds to the non- 
compact group SO(1, 2) for which the invariant length is given by x3— 
(x? + 23). The associated Lie algebra under consideration is so(1, 2) 
(which is isomorphic to su(1,1)). In this case, the quadratic Casimir 
operator (which commutes with all the generators as in (3.53)) can 
be calculated to have the form 


h=P=R+ ; Ge mee ae ee ; (ata blo) = (3.91) 
which leads to an infinite dimensional unitary representation of the 
Lie group SO(1, 2). 

Before concluding this section, we note that the validity of (3.64) 
would depend on the definitions of X°; in (3.48) (and the corre- 
sponding A%s) and TH1#2""#” in (3.63). This necessarily brings in the 
symmetry properties associated with the tensor T7#142"""", The con- 
struction, in fact, has a much more general validity independent of 
the identifications in (3.48) and (3.63). Let us consider a more general 
mixed tensor (state) of the form T/i/.?. 2” which may or may not have 
any symmetry such as total symmetry or anti-symmetry associated 
with its indices. If X% and X operate on this tensor yielding 


n 


be F seh Oras 
XT Ln =) De Toasty, 1QUj41°"Un 


V{V2"-Vm 
j=l 
m 
i ) OL PHA M2" bn 
Oo ace aoe 
k=1 
He 2" [1 2s 
BG Us ee _ (n = Lae ate ae (3.92) 


then, it can be verified easily that the operators X “3 satisfy the u(V) 
algebra (3.50) without assuming any identification such as (3.48). 
The verification proceeds as in the calculation of the eigenvalues of 
the second order Casimir invariant Jy described in the next section. 


3.3 Structure constants and the Killing form 


Let L denote a Lie algebra. Since L defines a vector space (see 
(L1)), we can find a linearly independent basis e1,e2,--- ,eq in this 
space where d = dim L. Here we assume L to be finite dimensional, 
namely, we are considering a finite dimensional Lie algebra L. We 
note that even though L is finite dimensional, the associated Lie 
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group is infinite dimensional. For example, let us consider the Lie 
algebra L = so(3) which is three dimensional (there are only three 
elements or generators in the algebra), but the Lie group G = SO(3) 
is clearly infinite dimensional (the number of elements of rotation 
corresponding to different rotation parameters is infinite). This is 
one of the reasons why Lie algebras are easier to deal with than Lie 
groups. 

Returning to our discussion, let us note that if e),e9,--- ,eg 
define a basis of L, then the bilinear product [e,,e,] € L where 
pi, v = 1,2,--- ,d (see (3.2)) so that the product can be expanded in 
the basis e) as 


[ens ev] “oa ve (3.93) 


for some constants Ce which satisfy the conditions 


A Xr 
ChHaCy, (3.94) 
C108) + 09,02, + 02,08, = 0, (3.95) 


where summation over the repeated index a is understood in (3.95). 
Here p,v,A,a, 0 = 1,2,---,d. The anti-symmetry in (3.94) is obvi- 
ous from the definition (3.2) since 


leu, ev] = —[ev, eu]. (3.96) 


The nonlinear relation in (3.95) can be seen to follow from the Jacobi 
identity in (3.4) in the following way. Using (3.93), we note that 


d 
[leu ev], ey] = Ss" Citas o| 
a=1 
d 
= 5 Cles.es (by (3.3) 
a=1 
d d 
=Son + hes (by (3.93) 


SC OLR (3.97) 


a,pb= 1 
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so that the Jacobi identity (3.4) leads to 


0= le, ev], ea] + [[ev, eal, ex] + [lea, eu), ev] 
d 
= > (cacé, + 03,08, + 08,08, es, (3.98) 
a,B=1 


which proves (3.95) since the basis is assumed to be linearly indepen- 
dent. 

Conversely, if we have a set of constants OF, satisfying the con- 
ditions (3.94) and (3.95) and if we define the bilinear product as in 
(3.93), then we can readily show that it satisfies the Jacobi identity 
(3.4) and completely determines the Lie algebra. The constants Cr, 
are known as the structure constants of the Lie algebra. 

For any Lie algebra ZL, we can find at least one nontrivial d x 
d matrix representation (corresponding to the dimension of L) as 
follows. Let e€1, €2,--- ,eq be a basis of L with the structure constants 
Cy. Let us introduce the d x d matrix paa(e,) by 


(Pad(€n)), = -Cr,, 
d 


d 
(Pad(©)) jy, = Pal, Exer) = ¥- € (Pad(€r)) pv + (3.99) 
A=1 


A=1 


d 
where « = )> €)e) with constant coefficients €) is an arbitrary ele- 
r=1 
ment of L. We note that the second relation in (3.99) is due to the 
linearity of paa(x). Using the standard matrix multiplication, we now 


obtain 


[Paa(€p); Pad(ev lag = (Pad(€u) Paa(ev) 4 Pad(€v) Pad (ep) avg 


d 
=S> ((Padle))ax (Paa(ev))ag — Paaler))ar (Paden) 9) 
A= 


d d 
=> (-cA.c8 - CU.C%,) =) CC. 
A= 
d 


d 
= ON (=08 |S Oates (3.100) 


A=1 A=1 
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where we have used (3.94) and (3.95) in the intermediate steps. 
Therefore, we see that the d x d matrices paa(e,) satisfy the rela- 
tion 


[Pad(€), Pad(ev)| =e Pad (EX) 


= Paal ome = Pad( (eu; ev), (3.101) 


where we have used (3.93) as well as (3.99). Consequently, it follows 
that Paa(x) for 


d 
r=) bey, (3.102) 
p=1 


where €,, denote real or complex constants, provides a d x d matrix 
representation of IL known as the adjoint representation of the Lie 
algebra. 

The adjoint representation can actually be derived more eas- 
ily in the following manner. Let us recall the relation between linear 
operators and matrix representations in a vector space (see, for exam- 
ple, (2.17)-(2.22)). We can define a linear operator called the adjoint 
operator adx with respect to any element of the Lie algebra x € DL 
acting on vectors in the vector space L by 


(adx)y = [x,y]. (3.103) 


Equivalently, we can also define the linear operator A, = ade, u = 
1,2,--- ,d which corresponds to the adjoint operator with respect to 
a set of basis vectors in this space and we see, from (3.103), that it 
satisfies (see also (3.97)) 


Ane = legsen| a (3.104) 


Using (3.103) we note that we can write 


lly, 21,2] = —[2, ly, 2]] = —[a, (ady) 2] = — (ada) (ady) z 
[2,2], u] = ly, la, 2]] = [y, (ada) 2] = (ady) (adx)z. (3.105) 
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It follows now that the Jacobi identity (3.4) can be written as 


[x,y], 2] + [ly, 2], 2] + (lz, 2], y] = 0, 
or, ((ad|x,y]) — (ada) (ady) + (ady) (adx))z=0, (3.106) 


which must hold for any three arbitrary elements x,y,z € LD and, 
therefore, determines 


(ad [x, y]) = (ad x) (ady) — (ady) (ad) 
= |(ad x), (ady)). (3.107) 


Consequently, we can identify 


(adey) = pad(eu), (ad) = paa(®), (3.108) 


as a representation, the adjoint representation, of the Lie algebra L. 
We also note here that the adjoint representation of the so(N) Lie 
algebra is isomorphic to that obtained from any second rank anti- 
symmetric tensor T),, = —T,, operator which is left as an exercise. 

Let us next introduce a symmetric inner product for any two 
elements x,y € L by 


(rly) = Tr ((adx) (ady)) = (yla), (3.109) 


which is clearly symmetric because of the cyclicity of trace (“Tr” 
denotes matrix trace). Furthermore, we note that 


([z, y]|z) = Tr ((ad[zx, y]) (ad z)) = Tr (([(ad x), (ad y)]) (ad z)) 
= Tr ((ad x)(ady)(ad z)) — Tr ((ad y) (ad x) (ad z)) 
= Tr ((adx)(ady)(ad z)) — Tr ((adx) (ad z)(ady)) 
= Tr ((ad 
[ 


x) ([ad y, ad 2])) 
= (2|ly, 2]); 


and this is easily seen to be completely anti-symmetric under the 
exchange of any pair of elements x,y,z € L. Here we have used the 
cyclicity of trace as well as (3.107) in the intermediate steps. 


y, 2] (3.110) 


For a set of basis vectors e1, €2,:-- ,e€q of the Lie algebra, let us 
define the symmetric tensor 
Juv = (euler) = Tr ((ade,) (adev)) = guy, (3.111) 


which is known as the Killing form (named after the German math- 
ematician Wilhelm Killing) or the metric tensor of the Lie algebra. 
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Using the identification in (3.99) (see also (3.108)), we can write ex- 
plicitly 


d 
Vins S- (ad ey) 3 (ad ev) g = s Cl. oe (3.112) 
a,B=1 a,pb= 1 


Similarly, following (3.110) we can introduce a completely anti-symmetric 
third rank tensor as 


fur = (ley, ev|lea) = (eyl lev, ea])- (3.118) 
Using (3.93) as well as (3.111) we see from the definition (3.113) that 


d 


d 
Funk = (leu ev |Ex) = uv €al€r) = »_ Ch gar, 
LU 
a=1 


d 


d 
fr = (el ev, €]) =e sh (eulea) = S- CHIpor (3.114) 


so that we can write this completely anti-symmetric tensor explicitly 
as 


d d d 
fwd = S- Ch Gar = S- Cor Go = ys C*XJav; (3.115) 
a=1 a=1 a=1 


where we have used the symmetry of the metric tensor and the last 
relation comes from the total anti-symmetry (under the interchange 
of any pair of indices) in the definition in (3.113) as well as the anti- 
symmetry of the structure constants Ch (see (3.94)). Such a com- 
pletely anti-symmetric third rank tensor f,,,, is often used in physics 
in connection with gauge theories, as we will see in chapter 8. 

As we will see in the next section g,,, will be non-degenerate for 
the class of Lie algebras known as semi-simple Lie algebras. Namely, 
for such Lie algebras, there exists an inverse metric tensor g'” satis- 


fying 
S- guy = 6, by =1,2,--- ,d. (3.116) 


In this case, we can define the second order (quadratic) Casimir in- 
variant as 


d 
b= > g”o(en)o(er), (3.117) 


bv=1 
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for any representation p(e,) of L, not necessarily the adjoint repre- 
sentation Paa(e,). (Namely, the metric tensor is clearly defined from 
the basis matrices in the adjoint representation, but once it is de- 
fined, it becomes a fixed tensor in the vector space and can be used 
for raising and lowering indices in any representation.) It now follows 
that 


2, plex] = D> 9” [o(en)e(er), pler)] 


d 
= SF 9” (o(en)[o(er), plea] + [oley), o(er)lolev)) 


bv=1 


d 
= > Sg” (C2 p(en)p(ea) + CH p(ea)p(ev)) 


pjv=la=1 


d 
= > (gMC%, +9°°C%,) plenoler), (8-118) 


where we have interchanged a © vy in the first term and a © pw 
in the second term in the last line (together with the symmetry of 
the inverse metric). We can simplify the right hand side by using the 
definition (3.116) as well as the anti-symmetry (3.94) of the structure 
constants as follows, 


d d d 
de (9 Coy + 9°C2y) = D9 Con + DI Cho? 
a=1 B=1 


a=1 
d 


= SO gt (Cos986 + GaoC§y) 9”? 
a,B,co=1 
( 


d 


= Ss" g!® (C8908 — Cg9ea) 9”? 
a,B,o=1 


=0, (3.119) 
where we have used (3.115) in the last step. This shows that 
[T2, p(e)] = 9, (3.120) 


namely, the quadratic Casimir operator commutes with every ba- 
sis element and, therefore, with any arbitrary element p(x), € L 
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of the Lie algebra. (clearly, the Casimir operator is defined upto a 
multiplicative (normalization) constant.) If p(x) is irreducible, then 
Schur’s Lemma (2.103) implies that Jj is proportional to the n x n 
unit matrix EF, (n denotes the dimensionality of the matrix represen- 
tation), namely, 


Tp = CoEn. (3.121) 


The constant C2 corresponds to the eigenvalue of the Casimir invari- 
ant J2 for the particular representation under consideration. In the 
case of su(2), we have 

1. 3 


Co =j(F +1), DS a tiger 3 (3.122) 
if we choose a suitable normalization. 

Let us consider the examples of u(V) or su(V) Lie algebras and 
denote the matrix representation p(X‘,) as X‘,, for simplicity (see 


(3.50)). Then, for u(NV) the second order Casimir can be written as 
Ip = X",X", (3.123) 


while for su(V) (see (3.54) and (3.56)) it is given by 


1 1 
B= AMA’, = XY XY aa Xe Sy oe (3.124) 
where we have used the definitions in (3.52) and (3.54) (and repeated 
indices are summed). If we operate J on the completely symmetric 
tensor (state) in (3.64), namely, 


THAb2 on Gat ot al, |0), (3.125) 


and calculate using (3.65), we obtain (recall that a, 8 = 1,2,---,N 
for u(N) or su(NV)) 


[THE Hm DD Cet aia 


n 
— x5 oOo EAB e ARS tire 
j=l 
n 


y x" Tab Hj 1B Bj +17 Hn 


J 


BR 


n 
= 5 OP a ae aia ea a 


j=l kAj 
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ais SPO EEN ey 
= eae ara Sen 
JAK 
+ Nn Thi be bn 
= (n(n — 1) + Nn) THA Hn 
=n(N +n —1)THHe He, (3.126) 
so that the eigenvalues of Iz (the quadratic Casimir for u(N)) for 
such a representation is given by n(N + n-— 1). Here we have used 


the symmetric nature of T¥#!#2""#" in the intermediate step. 
Similarly, it follows that 


n 
te a te Mi se [bj—1 pb tie 
XT in — KO Pian » On) THIH2  Hi-LOH +1 hn 


j= 


n 
= S> Tea Hn pT HHO tin (3.127) 
j=l 


which leads to 


be 1 us 
ree Mn — (1 _ 7%”) THAB2 bn 


n2 
= (now +n—1)- 7) T HAH 2" Mn 


_(N- pny +n) PHAM bon (3.128) 


Therefore, the eigenvalues of the quadratic Casimir operator for su(V) 
for such a representation is given by Wes) For su(2), this 
leads to 


1 
BTM Hm = n(n + 2)Teeein = 255 + LTMMHn, (3.129) 
for j = 5. This reproduces the well known result that 
pat an a (3.130) 
= 5 ANA, ; 


1 4.3 


has eigenvalues j(j + 1) with j = 0,5,1,5,---. 
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3.4 Simple and semi-simple Lie algebras 


Let L be a Lie algebra so that we have 


ie CD. (3.131) 


However, L can also possess its sub-Lie algebra Lo C L such that 


(Des Pae he. (3.132) 


Moreover, it can also happen that the subalgebra satisfies a stronger 
relation 


[Lo, L] € Lo, (3.133) 


and in this case, we say that the subalgebra Lo is an ideal of L. When 
the solution to (3.133) excludes the rather (obvious) trivial cases of 
Lo = L and Lo = 0 (null), we say that Lo defines a proper ideal of 
the Lie algebra L. A Lie algebra L which is not one-dimensional and 
does not contain any proper ideal is known as a simple Lie algebra (or 
simply a “simple” algebra). On the other hand, if L contains proper 
ideals, but none of the proper ideals is Abelian, then it is known as 
a semi-simple Lie algebra. We can further classify Lie algebras into 
categories such as solvable, nilpotent, etc. However, we will not go 
into details of these since we will not use them in any essential way 
in our discussions. As noted earlier, we will consider L to denote 
a complex Lie algebra. Let us next note some important theorems 
within the context of Lie algebras without going into proofs which 
would be beyond the scope of such a book. 


Theorem. Any semi-simple Lie algebra is a direct sum of simple Lie 
algebras, i.e. a semi-simple Lie algebra L can be written as 


L=1,@120:::@OlLm, (3.134) 


where L;, i =1,2,--- ,m denote simple Lie algebras with 


[Li,Lj]}=0, for iF}. (3.135) 
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Theorem. (Cartan’s criterion) 
A necessary and sufficient condition for a complex Lie algebra 
L to be semi-simple is that its killing form 


(x|y) = Tr (adx) (ady), (3.136) 


is non-degenerate. In other words, if e1,€2,°-- ,eq denote a basis of 
L, then (see also (3.111) as well as (3.116) ) 


Juv = Tr (ade,) (ade,) , (3.137) 
d 
has an inverse gt” satisfying >> Gung = 67, for u,v =1,2,--- ,d. 
d=1 

Theorem. Any finite dimensional matrix representation of a semi-simple 
Lie algebra is fully reducible. 
> Example (u(V) algebra). As we have seen in section 3.2.3, we can identify the 
basis elements of u(NV) as 

Delay. (3.138) 


where Gy,, al, pu = 1,2,--- , N denote respectively the annihilation and creation 
operators of a N-dimensional harmonic oscillator and satisfy the u(NV) algebra 
(see (3.50)) 


[X",, Xe] = OFX", — OB XG, (3.139) 
which defines a non-Abelian algebra. As we have seen in (3.52), in this case, we 
can identify the (linear) invariant operator (linear Casimir operator) 

ne ee ao (3.140) 
which commutes with all the basis elements (generators) of L (and, therefore, with 
itself) 

[, xX] = [X, xX) =0, [h,h] = [x, X] =0. (3.141) 
As a result, the set of elements Lo = {Ali} for any complex number A defines a 
proper ideal of L = u(N), since 

[Lo, L] =0C Lo. (3.142) 


Therefore, u(N) cannot be a simple algebra. But, we also note that u(NV) cannot 
be a semi-simple algebra because Lo is an Abelian ideal since 
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[Lo, Lo] = 0. (3.143) 


This shows that the u(NV) algebra is neither simple nor semi-simple. In fact, let us 
note that, since su(V) is known to be a simple Lie algebra, if we write (of course, 
dimensionally this is permissible) 


u(N) = su(N) @ {Ah}, (3.144) 


where {AJ} defines a one-dimensional Abelian algebra, then this would have the 
general structure of a direct sum of a semi-simple and an Abelian Lie algebras. 
Such a Lie algebra is known to be a reductive Lie algebra and any (real) reductive 
Lie algebra is known to be derived from a (real) compact Lie algebra and vice 
versa in some sense. This is true for u(N) and, therefore, u(N) is a reductive 
Lie algebra. It is worth pointing out here that the Poincaré Lie algebra is neither 
simple nor semi-simple as well, but is not reductive. 

< 


> Example (so(2,1) Lie algebra). We have already studied the angular momen- 
tum algebra 


[Ji, Jj] = teign de, (3.145) 


corresponding to the equivalent Lie algebras so(3) ~ su(2) (see (3.31) and (3.60)) 
which are equivalent. More explicitly, we can write the algebra in (3.145) as 


[Ji, Jo] =tJ3, [Jo, Js] =2s1, [J3, Ji] = tJe. (3.146) 


As we have discussed earlier, the groups SO(3) and SU(2) are compact groups 
describing rotations. Therefore, the finite dimensional representations of these 
Lie algebras are unitary and describe the angular momentum states which are 
essential in studying rotation invariant quantum mechanical systems. 

There are two other equivalent Lie algebras, so(2,1) ~ su(1,1), which are 
also very useful in studying some quantum mechanical systems and we will discuss 
here the unitary representations of these algebras. (This can be thought of as the 
Lorentz algebra in three dimensions.) Like angular momentum, this algebra also 
consists of three generators Jj,i = 1, 2,3 satisfying the commutation relations 


(Ji, Jo] = —tJs, [Jo, Js] =id1, [Js, Ji] = ido, (3.147) 


which is similar to (3.146) except for the sign in the first relation. It is clear that 
the algebra (3.147) can be mapped to the angular momentum algebra with the 
redefinition 


J => iJSi, Jo => tJ. (3.148) 


However, under this redefinition, Ji:,J2 cease to be Hermitian. Of course, the 
finite dimensional representations of so(2,1) ~ su(1,1) can be obtained in this 
manner from those of the angular momentum algebra, but these representations 
are not unitary. This is simply a reflection of the fact that the groups SO(2, 1) 
and SU(1,1) are non-compact (like the Lorentz group) for which there is no finite 
dimensional unitary representation. (The generator J3 can still be thought of as 
generating compact rotations around the z-axis, but Ji, J2 generate non-compact 
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transformations along the «, y-axes.) In quantum mechanics, on the other hand, 
we are interested in unitary representations. For a non-compact symmetry these 
are necessarily infinite dimensional. 

To construct the infinite dimensional unitary representations, let us continue 
with (3.147) where all the generators are Hermitian 


(J:)' = Ji, i=1,2,3. (3.149) 


The quadratic Casimir invariant of the algebra (3.147) can be checked to corre- 
spond to 


In = (hi)? + (Je)? — (Js)?, [Io, iJ =0, 2 =1,2,3. (3.150) 


This can be compared with the Casimir invariant of the angular momentum alge- 
bra given by Iz = J? = (Ji)? + (Je)? + (Js)?. We can, of course, diagonalize only 
one of the generators of the algebra along with Jz and we choose to diagonalize 
the compact generator J3. In the usual manner we can conclude that the eigen- 
values of J3 take only integer or half integer values (allowing for double valued 
representations), namely, 

1 3 


J3: m=0,+1,+2,---, or, m=+t5,t5,°°- (3.151) 


As in the study of angular momentum algebra, let us define the operators 


Jz =Si tik, (Jz)! = Jz, (3.152) 


which can be checked (using (3.147)) to satisfy the commutation relations 


[J+i, Ja] = [Jr + tJ, J3] = —tJo + i(tJ1) 
=> (Ai as iJ2) = ary pm (3.153) 


Therefore, Ji correspond to the raising and lowering operators which raise/lower 

the eigenvalues of Jz; by one unit. Namely, acting on an eigenstate of J3 with 

eigenvalue m, they take us to another eigenstate of J3 with eigenvalue (m+ 1). 
We note that 


Ja. Jq = (Si £id2)(Ii Fide) = (hh)? + (Je)? F if, Je] 
= (Hi)? +(e)’ $ i(-tJs) = (a)? + Ja)” = s)”) + (Js)? ¥ J 
= Ip + J3(J3 — 1). (3.154) 


Equivalently, we can write 


Side = BAD. (3.155) 


From (3.152) we see that Jz Jz = JLJz is a positive semi-definite operator 
and, therefore, we conclude from (3.155) that the quadratic Casimir operator 
for so(2,1) is not positive semi-definite, unlike in the case of the angular momen- 
tum algebra. The eigenvalues of Jz, in this case, can be positive or negative or 
zero. Let us denote the simultaneous eigenstates of [2 and J3 as satisfying 


J3|j,m) = mlj,m), m =0,+1,+2,--- ’ or, TIS pater a ’ 


I2|j,m) = —5( + 1)|3,m). (3.156) 
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We point out here that in some literature the eigenvalues of Jz are denoted by 
k(1 — k) with the identification k = —j. Furthermore, since the second equation 
is invariant under 7 © —(j +1), without any loss of generality, we will restrict 
ourselves to 7 < 0. Depending on the values of (j,m), the unitary representations 
of the algebra are classified into three categories which we discuss below. 


(t) Discrete/analytic series: Here we assume that the infinite dimensional 
representations are bounded from below or from above. In this case, taking the 
expectation value of (3.154) in the state |j,m) and recognizing that the operator 
JsiJz is positive semi-definite, we obtain 


(i,m | Je Jel), m) = Gh m| Ip + J3(J3 =F 19, m) 


= —j(j +1) + m(m F 1) 2 0, (3.157) 


where we have assumed that the eigenstates are normalized. For a fixed value of 
j, let us assume that there exists a state |j, min) such that 


J_|j,mmin) = 0. (3.158) 
In this state, (3.157) leads to 
— GG +1) + mmin(Mmin — 1) = 0, 
Or, Mmin = —j > 0. (3.159) 


Such a representation is bounded from below and is denoted by D5. It is labelled 
by 


1 3 
- —=,-2,-++. 3.161 
: (3.161) 
It is clear that given a value of j, the representation is uniquely determined and 
is infinite dimensional (since m takes an infinite number of values). We note here 


that for 7 = —5, the eigenvalue of the quadratic Casimir operator is positive 
while for 7 = —1 it vanishes. For all other values, it is negative. The trivial 
one dimensional representation corresponding to 7 = —1 will not be considered 
further. 


In a similar manner, if there is a state |j, Mmax) such that 
J+|j,Mmax) = 0, (3.162) 
then (3.157) in this state determines 
— (fj +1) + mmax(™Mmax + 1) = 0, 
Or, Mmax =j <0. (3.163) 


Such a representation is bounded from above and is denoted by D; . It is labelled 
by 
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Principal series: Besides the discrete representations discussed above, there are 
also continuous unitary representations of the Lie algebra and the principal series 
is one of them. They are known as continuous representations because j takes a 
continuum of values. Furthermore, these are representations where there does not 
exist any state satisfying 


J_|j,m) =0, or, J+|g,m) =0. (3.165) 
Correspondingly, the value of m is not restricted either from above or below. 
In the continuous representations, we assume that j(j7 +1) < 0 (so that the 


eigenvalue of the Casimir operator Iz is positive). In the case of the principal 
series we allow the quantum number 7 to be complex. Therefore, writing 


j=ati, (3.166) 
where a, X are real, we obtain 


jG +1) = (a +idA)(a+14+7)) = real, 


or, (a(a+1)—- 7) + idA(2a + 1) = real, 


or, Q= -;. (3.167) 
Furthermore, using this value of a in (3.166), we can check that 
; Vit engl 
jG +1) = (—= + iA)(= + iA) < 0, 
2 2 
1 \2 
or, ~+A°>0. (3.168) 


4 


In this case, the eigenvalues of the Casimir operator takes values + <-jij+l)< 
oo (which is known as a non-exceptional interval). In this case, we note from 
(3.154) that 


. , shi 1 
(Gm) JaJgli.m) =—Jjj +1) +m(m #1) = +(m# 5)? >0, (3.169) 
which holds for any finite value of A. As a result, there is no state in the repre- 


sentation for which 
J_|j, m) =0, or, J+J, m) = 0, (3.170) 
so that the m quantum number in these representations is unbounded from below 
as well as from above. Further investigation shows that, in this case, 
1 3 


res eee 171 
m Boe (3.171) 


Complementary series: The complementary series (sometimes also called the 
supplementtary series) also describes a continuous unitary representation where 
m is not bounded either from below or from above. Here we assume, as in the case 
of the principal series, that 7(j + 1) < 0 (so that the eigenvalues of the quadratic 
Casimir operator is positive). However, we require j to be real unlike the earlier 
case. Thus, we have 


iG +1) <0, 
apts l\o 
or, (+5) < (5), 
Tal 
eee eee 172 
Of RF SS (3.172) 
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This determines (recall that j < 0) 
at ope: (3.173) 


Global properties of the group further restrict this finite interval to 


1 
-l<j< a5) (3.174) 
Equivalently, we can identify 
: 1 
nae A, (3.175) 


where is real and satisfies 


1 
0<A< 5° (3.176) 
and j can take any continuous value in the finite interval (3.174). 
We note that in this interval —j(j + 1) = + — ? so that using (3.176) we 
have 0 < —j(j+1) < } (which is known as an exceptional interval). From (3.154) 
we then note that 


(j,m| Ji J<|j,m) = —j(§ +1) +m(m F 1) = (mF =)P?-AN >0. (3.177) 


As a result, there is no state in the representation for which 
J_|j,m) =0, or, J4|j,m) =0, (3.178) 


so that the m quantum number is not bounded from below or from above. Further 
investigaten shows that in this case 


m =0,+1,+2,---. (3.179) 


The Lie algebra so(2,1) ~ su(1,1) is useful in studying scattering states of 
potentials such as the Péschl-Teller potential. It also manifests in the study of 
two dimensional (anti) de Sitter space. 

< 


> Example (so(4) Lie algebra). The so(N) Lie algebras are known to be simple 
for N = 3 and N > 5. (so(2) is an Abelian algebra.) However, so(4) is not 
simple, rather it is a semi-simple Lie algebra which can be seen as follows. Let us 
recall the discussion in section 3.2.2 and note that the basis elements Jv = —Jvy 
with (in section 3.2.2 we denoted the basis elements as the angular momentum 
operators Lj.) 


[Juv Jap] =1 (Ouadve + dvpI pe FZ Ou pIve ae Oved us) s (3.180) 


where pt, Vv,a, 0 = 1,2,3,4. If we now identify 


1 1 
j= 3 (Jog t+Jia), Jo= 3 (J31 + Joa), J3 = = (Jia + Jza), 


(Ji2 — Jza), (3.181) 


Nl wNle 


1 1 
Kki= 3 (Jas — Jia), ko = 5 (Js — Joa), Ka= 


94 3 LIE ALGEBRAS 


or equivalently 
1/1 1/1 
J, = 3 (Jed + Jia) eS 3 (Feud — Jj ) ‘ (3.182) 


where i, j,k = 1, 2,3, the algebra (3.180) can be written as 


[Ji, Jj] = teign dn, (3.183) 
[Ki, Kj] = ieijn Ke, (3.184) 
[Ji, Kj] = 0. (3.185) 


We recognize from (3.183) and (3.184) that J; and K; individually satisfy the 
so(3) algebra (see (3.31) as well as (3.60) and note that the two algebras so(3) 
and su(2) are equivalent). Furthermore, from (3.185) we note that J; and K; 
commute so that we can write 


so(4) = so(3) © so(3), (3.186) 


which shows that the so(4) Lie algebra is a direct sum of two so(3) Lie algebras 
which are simple Lie algebras. This proves that so(4) is a semi-simple Lie algebra. 

We note here that given the operators J,,, we can define their conjugates 
(in physics also known as dual) as 


Me 1 
Juv = 5 Env rp Xp, (3.187) 
where u,v, A, p = 1,2,3,4 and €,,,,, is the completely anti-symmetric Levi-Civita 
tensor in four dimensions with €1234 = 1. It is clear from this definition as well as 
from (3.181) that we can write 


1 F 1 5 1 * 
Ji = 3 (J23 + “ Joa), J2 2 (Jai + “Ja1), J3 = 3 (Ji2 + “Ji2), 


1 ‘ 1 5 1 id 
ky= 3 (J23 — Jzi), Ka 5 (Js1 — Js), Kg = 3 (diz — Siz); 


(3.188) 


which shows that J; and K; with 7 = 1,2,3 correspond to the self-conjugate 
and anti-self-conjugate components of J,,. Therefore, finding a representation 
of J, in terms of the representations of J; and K; would correspond to finding 
self-conjugate and anti-self-conjugate representations of so(4). 

As a simple application of the so(4) Lie algebra, let us determine the energy 
levels of a Hydrogenic atom described by the Hamiltonian 


[: oe eee (3.189) 


where Z denotes the nuclear charge, the reduced mass of the system and r = |r| 
corresponds to the length of the reduced coordinate vector (Z = 1 corresponds to 
the Hydrogen atom). Since three dimensional rotations define a symmetry of this 
Hamiltonian, the three angular momentum operators L;,i = 1, 2,3 defined by 
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L=xxp, Lj = €ijrXjpr, (3.190) 


commute with the Hamiltonian 


[Li,H]=0, i=1,2,3, (3.191) 


As we have seen in (3.31), the angular momentum operators L; satisfy the so(3) 
Lie algebra (we restore here the factor of h that arises in quantum mechanics from 
the identification p = —ihV) 


The Hamiltonian in (3.189) has a second (hidden) so(3) symmetry con- 
ventionally known as the dynamical symmetry or the accidental symmetry of the 
system. This can be seen by noting that if we define a set of three operators M; 
through the relation 


_“—r L-M=0, (3.193) 


then one can verify using the canonical commutation relations as well as (3.191) 
that 


[M;, H] = 0, (3.194) 


and the three operators M; generate transformations which leave the Hamiltonian 
(3.189) invariant. Furthermore, using (3.192) (as well as the canonical commuta- 
tion relations) it can also be checked that 


[M:, Mj] = — case Le, 


We note here for later use that we can write 
_ 2H 
Lb 


M2 


(L? +h?) +e’, (3.196) 


where the factor fh? arises because of non-commutativity of quantum operators. 
If we work in the subspace of the Hilbert space corresponding to a particular 
energy value EL, then the first relation in (3.195) has the form 


2hE 


[M;, Mj] = — ijk Le, (3.197) 


and a simple rescaling of the operators M; by 


Mo (-4) 2 ML, (3.198) 
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allows us to write the relations (3.192) and (3.195) as 


[Li, Lj] = theijrLe, 
[M;, Mj] = theijn Lr, 


This is a set of coupled commutation relations which defines an algebra. However, 
to understand its meaning, let us change the basis and define (compare with 
(3.181)) 


= 5(Ls+M), Kia 5 (Li- Mi). (3.200) 


Using (3.199), the new operators can be checked to satisfy the algebra 


[Ji, J5] = theijnIk, 
[Ki, Ky] = theijn Kr, 
[Ji, Kj] = 0, (3.201) 


which coincides with (3.185). Therefore, we see that there is a larger symmetry 


so(4) = so(3) © so(3), (3.202) 


operative in the Hamiltonian for the Hydrogenic atom. 

To determine the energy spectrum of the system, we note that each of the 
two so(3) algebras has an associated quadratic Casimir operator which we denote 
by J? and K? respectively. The representations of the two angular momentum 
algebras in (3.201) would be labelled by the eigenvalues of these operators which 
have the forms 


Js i(j+1), K? :Wk(k+1), (3.203) 


where j,k = 0, $ 1,---. Alternatively, we can define 


CaP 4K =5(+M), 


C’=J?-K?=L-M=0. (3.204) 


The second of the relations in (3.204) implies that for any allowed representation 
we must have 7 = k, which in turn implies that in any representation the quadratic 
Casimir C will have eigenvalues 2h?j(j-+1). It follows now that we can write (recall 
(3.196), (3.198) as well as the fact that we are in a subspace of the Hamiltonian 
with energy value FE) 
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C= 5 (2 +m) 
_ 1/2 2,2  p(Ze?)* 
5 (x L h aE 
Ze*)? 1 
(-“72) 5) 1 = 2ny(j + 01, (3.205) 
so that we determine 
y(Ze2)? y(Ze2)? 
Ey = - = ; 2 
2h? (27 + 1)? 2h2n? es) 
where we have identified 
n=2j74+1=1,2,3,---. (3.207) 


We recognize (3.207) as the correct energy levels for the Hydrogenic atom. 
However, the symmetry analysis can even determine the degeneracy associated 
with each of these levels. We note from (3.200) that we can invert these relations 
to obtain 


L=J+K, (3.208) 


and from the composition (addition) of angular momenta, we recognize that the 
eigenvalues of L? denoted by h?£(€+ 1) can take values 


=n-—1,n—2,---,1,0. (3.209) 


Namely, the energy level (3.207) would have a n-fold degeneracy associated with 
the quantum number of the orbital angular momentum ¢ which is a consequence 
of the larger so(4) symmetry of the system. (This is in addition to the degener- 
acy associated with the azimuthal quantum number m which results from three 
dimensional rotational symmetry.) 

< 


3.5 Universal enveloping Lie algebra 


We end this discussion of Lie algebras with only a brief comment 
on the enveloping algebras. Let L denote a Lie algebra. Then, by 
definition (see (3.2) and discussion there) if x,y € L, then [x,y] € L 
is defined, but xy is not well defined in LZ in general. However, we 
can formally introduce an abstract associative product xy, xyz,--- 
for any x,y,z € L satisfying the Jacobi identity. The resulting asso- 
ciative algebra is known as the universal enveloping algebra of the Lie 
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algebra L. It is infinite dimensional even if L is finite dimensional. In 
this case, the original Lie product [x, y] is identified with the commu- 
tator of L, namely, |x, y] = xy — yx. For example, let us consider the 
Lie algebra of so(3). The generators in the coordinate representation 
have the forms L; = —ie;;x (wish 7 regs) ,t,j,% = 1,2,3. In this 
case we note that the product of two generators L;L; is a quadratic 
differential operator not in the Lie algebra of so(3) (the commuta- 
tor, however, is linear and belongs to the Lie algebra). However, if 
we formally, introduce this operator product, the commutator can be 
identified with the product in so(3). The operator products define an 
associative algebra in the sense that L1(L2L3) = (11 L2)L3 = Li LoL3 
which is infinite dimensional and defines the universal enveloping al- 
gebra of so(3). Moreover, we can also introduce the concept of a 
Casimir invariant in this universal enveloping algebra. For exam- 
ple, for the u(NV) Lie algebra, we can define the quadratic Casimir 
invariant to be the product 


= a (3.210) 
without appealing to the matrix product of a matrix representation 


of the Lie algebra L. 
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CHAPTER 4 
Relationship between Lie algebras and 
Lie groups 


In the last two chapters we discussed the concepts of Lie groups 
and Lie algebras. In this chapter we will explore systematically the 
connection between the two. We start in the next section with the 
concept of an infinitesimal group and its generators and see how it 
helps derive the Lie algebra from a Lie group. 


4.1 Infinitesimal group and the Lie algebra 


Let G be a Lie group which we assume to be G = GL(N), U(N), SU(N) 
for simplicity. Let A denote its defining (N x N dimensional) ma- 
trix representation (sometimes also called the fundamental represen- 
tation) corresponding to the group element a € G. (As we have 
discussed in chapter 2, a representation of the group can be obtained 
either as matrices or as linear operators acting on a Hilbert space. 
However, here we will restrict ourselves only to matrices.) Let us next 
consider a group element a € G which differs only infinitesimally from 
the identity element so that, in this case, we can write 


a+>A=Ey +8 +O(e’), (4.1) 


for a real, infinitesimal constant parameter ¢. As matrices (4.1) takes 
the explicit form (u,v = 1,2,--- , N denote the matrix indices in the 
defining representation) 


AH, = OF + €Bt, + O(e*) 


1+¢B1, Bt, -:- eBly 
eo? 1 + 8? sate Ee" 
=| “° ie 1 +0(e), (4.2) 
By By + 14+6B% 


99 
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where 3';, parameterize the group. Since « is infinitesimal, it follows 
that 


det A= 1+ O(e) £0, (4.3) 
so that A~! exists as it should and has the form 


(A“)") = 68 — eft, + OC”) 


_— Bt, —€B, see Bly 
—¢82 e825. <8 _ 2 
z 1 ee 2 . N | 1 oe). (4.4) 
a eet. 


If G is the real GL(N) group, 8 would represent a real matrix 
(the matrix elements 6, would be real), while for the case of the 
group U(N), the additional condition (1.49) (with the identification 
A", = Gp, note also that (AT), = AY) 


N 
S- (4x,) AX, = by + O(2), (4.5) 
ASL 


requires that 

(ee = SOs (4.6) 
for complex constants 8",. For the group SU(N) we further need to 
impose the condition (1.56) 

det A= 1, 

or, det (1 +e8+ O(e’)) =1, 

or, exp Tr ln (1 +¢B+ O(e’)) =1, 

or, exp Tr (e8 + O(e?)) =1, 

or, exp(eTrB) =1+4 O(e’), 


N 
or, ire = Ss" By =0. (4.7) 


A=1 
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Therefore, we can summarize that the matrix elements 6", satisfy 


Be, = real and arbitrary, ifG=realGL(N), (4.8) 
(Bi) = 6 if G= U(N), (4.9) 
N 
S| 6\, =0 (in addition to (4.9)), if G= SU(N). (4.10) 
A=1 
We note here that (4.10) holds even for G = SL(N) without the 
condition (4.9). In this case, we call “a” (or the defining matrix A) 


to be an infinitesimal group element. 

Let us next consider a general n x n dimensional matrix repre- 
sentation of the group G given by the matrix U(a),a € G. If “a” 
corresponds to an infinitesimal group element, then correspondingly 
the n x n matrix representation U(a) will have a form 

N 
U(a)=En+e >> BY X", + O0(e), (4.11) 
pv=1 

for some n X n matrices X‘, which are known as the generators of 
the infinitesimal group and 6", parameterize the infinitesimal group. 
(Namely, there are, in general, N? such n x n matrices. Explicitly 
we can write them, for example, as (X“,);; for w,v = 1,2,---N and 
19 =e) 

For an arbitrary element b € G (not necessarily infinitesimal), 
we note that 


U-+(b)U(a)U(b) = U(b-!)U(a)U(b) = U(b tab). (4.12) 
We can calculate explicitly 


(b-'ab)" + (BT AB)", 


N 
= ((B)*, Av Bt) 
A,p=1 
N 
= ys (B (A +B) a? + O( 
A,p=1 
N 
= By Be +e *)*, BBY, + Oe) 
A=1 A,p=1 


= + Bt, + Oe), (4.13) 
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where we have identified 
N 
BY = (B'8B)", = » Bo) BBY. (4.14) 
A,p=1 
Equation (4.13) shows that (b~ tab) also defines an infinitesimal group 
element. Therefore, using (4.11) we conclude that 


N 
U(b-'ab) = Ente S> BX", + O(e?) 
pv=1 
N 
=E,+e DS) (B)* 6,BtX", + Oe) 
,y,r,p=1 


N N 
yn 
= En +e > BY D> (BY, BY X4) +0(8), (4.15) 
pjv=1 A,p=1 
where we have rearranged the summation indices uw © A, © p in 


the last step. Substituting this into (4.12) and using (4.11) we obtain 


N 
U-"(6)(En +e D> BX, + (8) UO) 


bv=1 
N N 
=E,te >> Bt, ( ye), BY,X*,) +0(e2), 
pv=1 A,p=1 
N N 
or, € SAY (U-*(0)X%,U(0) — D> (BU1)*, BY,X4) =0. 
p,v=1 A,p=1 


(4.16) 


For the case of G = GL(N), the parameters 6", are arbitrary con- 
stants so that, in this case, we conclude from (4.16) that 


N 
U-1(b)X4,U(b) = 5 (Bo1)*, BEX®. (4.17) 
A,p=1 


For G = U(N), we have (see (4.6)) (eee = —6', so that 
Re B",, = —ReB",, Im6", = Imp". (4.18) 


Apart from these constraints, 6" are arbitrary and (4.16) leads to 
(4.17) again. For the case of G = SU(N), in addition to (4.6), we 
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N 
have )*> 6», = 0 (see (4.7)) so that we can require the generators X", 
A=1 


to satisfy (“Tr” denotes summing over the indices u,v only and not 
the matrix trace) 


N 
Tx =} x4 =0. (4.19) 
A=1 


We note that we can always achieve this by redefining the generators 
as 
1 N 
REN a S> x, (4.20) 
d=1 
so that we can consider the constant parameters 8", to be arbitrary. 
(This discussion applies to G = SL(N) as well without the constraint 
(4.6).) As a result, once again (4.16) leads to (4.17) and we conclude 
that (4.17) holds for G = GL(N),U(N), SU(N) (and SL(N)). 
Let us recall from (4.2), (4.4) and (4.11) that, for an infinitesimal 
group element b, we have (note that b > B in the defining N x N 
representation) 


(Bol)*, = 6b — 8", + O(€), 


N 
U(b)=Ente >> BY Xx”, + O(e?), (4.21) 
HV=1 


so that we can write 
N me 
U-"(b) =U(0") = E,—€ SS BX", + O(€?). (4.22) 
pv=l 


As a result, we can calculate 


N 
U-1(b) XU (b) = (Zn —¢« >> BY X*% + O(c) x4, 
A,p=1 


x (En +e 3 cue Game o(e)) 


o,T=1 


N 

=r 

=X" +e) > 8, (X4X% — X*%X4) + O(2) 
A,p=1 
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=X" +e 3 B, [X#,, X°]+O(e?). (4.23) 
A,p=1 


On the other hand, from (4.17) and (4.21) we also note that for an 
infinitesimal group element a 


N N 
U4 =). By Bex SB (Boy. 
A,p=1 A,p=1 


N 
-> (01 +B", + O(e)) KP (a 28 Oe) 
A,p=1 


N 
=KE RCS B (Xe Xe) LO). (4.24) 
A,p=1 
Substituting (4.23) and (4.24) into (4.17) we obtain 
ou » 
S> Bp ([X4%, X4] — RXF, + EXK) = 0. (4.25) 
A,p=1 


As before we can argue that since the parameters B, are arbitrary 
(4.25) leads to 


[X#,, XP] = SEXP, — 6P XH, (4.26) 


for all cases of G = GL(N),U(N), SU(N) (and SL(N)). Relation 
(4.26) describes how the Lie algebra for these groups can be obtained 
from the concept of an infinitesimal group, in particular defining the 
Lie algebra of sl(V). This shows that the generators of the infinitesi- 
mal Lie group constitute the basis elements of the corresponding Lie 
algebra. We may wonder here as to what may be the difference be- 
tween the Lie algebras, say for example, between gl(N) and u(JN). 
To see this, let us next consider the converse problem of constructing 
the Lie group from the Lie algebra. 


4.2 Lie groups from Lie algebras 


To begin with let us consider the Lie group G = U(N) so that any 
arbitrary group element a € G can be represented by a N x N unitary 
matrix A in the defining representation. It then follows that we can 
express 


A= &), (4.27) 
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for some N x N matrix €(,) which is anti-Hermitian, namely, 


Ela) = —€ (a), (Ey * = eas (4.28) 
where pp,v = 1,2,--- ,N. Defining the N x N unitary matrix 
Caen §(a) (4.29) 


for some integer M, we conclude that it must correspond to a group 
element c € G. It follows from (4.27) that we can write 


A= oO", 
U(a) = U(c-c---c) =U(c)U(c):+-U(c) = (U(c))™ , (4.30) 


where U(a) denotes a n x n matrix representation of G = U(N). 
If we now consider M to be a very large integer, then the N x N 
matrix C' in (4.29) becomes infinitesimal and has the form 


1 1 
Ce oe oF + AP av + O(s72): (4.31) 


so that we can write (see (4.11)) 


N 
1 7 i 
U(c) = En + 55 Py Sop Oa): 
bvV= 
M 1 = m yi 1 M 
U(a) = (U(e)" = (En + = 2 ele ,+O(q))* (4.32) 
Y= 


where X“, represent the generators of the infinitesimal group for G = 
U(N) and are n x n matrices (see the comment after (4.11)). As we 
have seen in (4.26) these constitute the basis elements of the Lie 
algebra u(V) as n x n matrices. If we now take the limit M — on, it 
follows from the second relation in (4.32) that we can write 


ss Shy 
Uta) =e"! . (4.33) 


when (4.27) holds and where Gy can be thought of as numbers. 
This shows that a n x n matrix representation of the unitary group 
G = U(N) can be obtained by exponentiating the basis elements of 
the Lie algebra u(N) as n x n matrices. 
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For the case of G = real GL(N), we note that we may not be 
able to represent some of the elements a € G in the exponential form 
of (4.33) (for example, if a + A corresponds to a negative matrix). 
However, let us consider those elements a € G such that we can write 
it as 


avr A=, (4.34) 


for some real matrix €(,). Then, the same reasoning as before leads 
to (4.33). However, the difference between the groups U(N) and 
real GL(N) lies in the fact that for the former, the parameters ciay 


* 
are complex parameters satisfying (4) = i while for the 
latter ee are arbitrary real parameters. As a result, if we are inter- 


ested in an X n unitary matrix representation satisfying 


U'(a)U(a) = En, (4.35) 
the basis elements of the Lie algebras must satisfy (as n x n matrices) 

(xnyt = xe, for U(N), 

(x#) = —XEH, for real GL(N), (4.36) 


since (4.35) has to hold. Thus, we see that, for a unitary representa- 
tion, the basis elements of the two Lie algebras would satisfy different 
Hermiticity properties even though they satisfy the same Lie algebra 
relation (4.26). 

Let us next consider the real orthogonal group (rotation group) 
SO(N) = R(N) discussed in section 1.2.5. In this case, the infinites- 
imal group element can be written as 


a Aw = Ow + By + O(e), (4.37) 
where 
Bu = — Bop, (4.38) 


are real parameters so that the group element satisfies (see (1.67)) 
A’A= En. (4.39) 


Correspondingly, we can write the n x n dimensional matrices as (see 
(4.11)) 


N 
U(a)=Ent = >> Bw Xw + O(?), (4.40) 
2 pv=1 
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where the real n x n matrices X,,, satisfy 
Ng = =a: (4.41) 


This can always be achieved by replacing X,,, — 5(X, eg 
view of (4.38) and the factor 5 in (4.40) is there to avoid double 
counting. 

We can now explicitly evaluate the relation (4.12) for the present 


case. Using (4.40) we obtain 


N 
U-M(b)U(a)U(b) = Ent 5 D> Bu") XU (b) + O(€), 
V=1 
: (4.42) 


for an arbitrary element b € SO(N). On the other hand, because of 
the group property we note again that 


U~!(b)U(a)U(b) = U(b-)U(a)U(b) = U(b- 1 ab) (4.43) 


and we can calculate (as in (4.13)) 


(bab) |, > (BAB), 
N 
= (BT AB) = Ss" (BT) 1» Arp Bov 
A,p=1 
N 
~ (BT )un (Ino + €Brp + O(e’)) Bpv 
A,p=1 
N 
= Op THe S- Brxp Pr Bov fd O(c?) 
A,p=1 
_ Oy Te Buy at O(e’), (4.44) 


where we have idntified (see also (4.14)) 
7 N 
byw = ys: Brp By Bov. (4.45) 
A,p=1 


We conclude from (4.44) that (b~'ab) is an element of the infinitesimal 
group (even when 0 is arbitrary). Therefore, it follows from (4.40) 


108 4 RELATIONSHIP BETWEEN LIE ALGEBRAS AND LIE GROUPS 


that we can write 
U-+(b)U(a)U(b) = U(b- tab) 


N 
€ ~ 
= En + 5 Ss" BuvX pw + O(e?) 


bv=1 
é N N 
2 
=En +5 Dy Buy os; ByrBypXrp) + O(€). (4.46) 
byv= p= 


From eqs. (4.42) and (4.46) we obtain 
N 
U~*(b) XU (6) = ByurBupX rp; (4.47) 
A,p=1 
which can be compared with (4.17). 


If we now we choose the group element 6 to be infinitesimal, 
then we have (see (4.21)) 


Bu = Ov + Bis ot O(e), Ag = Bs 


Buy X pw Ole): (4.48) 


N 
€ re 
= X ww 1 9 y BrolXw, X yo] oF O(c’), (4.49) 
A,p=1 
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On the other hand, we note that 


N N 
SS BurBupXrp = S> (Sux + Bux + O(€?)) Xrp 
A,p=1 A,p=1 


x (O52 + Bro + O(e’)) 


= Xw are S- Bro (Ops 2cap si burxX pv) as O(e) 


A,p=1 
n 
E€ — 
= Xw — 9 S- Bp (OurXvp + OvpX pr — OupXvr— dvr»X pp) 
A,p=1 
EOE), (4.50) 
where we have used the anti-symmetry properties Bi — —Bupy as 


well as X,, = —X ,. Substituting (4.49) and (4.50) into (4.47) we 
conclude that in the present case we have the Lie algebra of the form 


[Xpv, X xo] — —6u,»Xvp -e OvpX pr a OupXvr oF OvrX pps (4.51) 


which can be compared with (3.24). 
Note that if we write an arbitrary rotation (not necessarily in- 
finitesimal) corresponding to the group element a as 


A=e&, ATA=En, (4.52) 


for some real N x N matrix €(q), then we can determine as in (4.33) 
that the n x n matrix representation can be written as 


5 ss ba LX, v 
ase ua (4.53) 


where X,,,, denote the basis elements of the Lie algebra. The unitarity 
of the representation U'(a)U(a) = Ep requires that 


4 ee (4.54) 


since €(q),, can be thought of as real constant parameters. Namely, 
the generators, in this case, will correspond to anti-Hermitian n x n 
matrices. This is the reason why it is often more convenient (and is 
commonly used in physics) to define 


Juv = Xp, (4.55) 
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which are Hermitian, Thy = Juv. 

For the case of the symplectic group Sp(2N), as we have seen 
in (1.103), we can write the matrices A in the defining 2N x 2N 
representation (corresponding to the group element a) as 


A’ enA = enn, (4.56) 


where €2y denotes the 2N x 2N dimensional anti-symmetric matrix 
defined in (1.101). The infinitesimal group element, in this case, can 
be written as 


Aw = bw + (Bw + Oe”), HYv= Ledges ,2N, (4.57) 


for some 2N x 2N matrix 6,,. The constraint (4.56) in this case, 
namely, A? €2v A = eon leads to (as a matrix relation) 


eanB + B eon = 0, 
or, (€2v 8)" = (en), (4.58) 


where we have used the fact that ae = —€)n (namely, it is anti- 
symmetric, see (1.102)). Correspondingly, we can write for an x n 
matrix representation 


2N 
e 
U(a) = Fon + 9 Py (ENP) wy Xw + O(e?), (4.59) 


where X,,, denote 2n x 2n matrices corresponding to the generators 
of the infinitesimal group which can be chosen to be symmetric, 


Xpw = Xyp; (4.60) 


in view of relation (4.58). We can now conclude from an analysis 
similar to what we have discussed earlier that the generators satisfy 
the Lie algebra 


[Xpv, Xrp] = €2n,praXvp + €anyrXpp — €2N,rAuX pv — €2N,vX pps 
(4.61) 


for p,V,A,p = 1,2,--- ,2N. 

Finally let us discuss the case of the Abelian group for which 
any two elements commute, namely, ab = ba,a,b € G where G is a N- 
dimensional Abelian group (for example, the N-dimensional transla- 
tion group T(NV)). Since the group elements commute, their defining 
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representation can be written in terms of N commuting parameters 
Bu, = 1,2,--.,N and we can write an infinitesimal group element 
(in any other representation) as a n x n matrix of the form 


N 
U(a)=En+e> = BuX,+ O(c’), (4.62) 
p=1 


for some real or complex parameters {,,. Here X,, denote the gener- 
ators of the infinitesimal group and as before we can deduce the Lie 
algebra of the generators. In this case, the commuting nature of the 
group elements leads to 


Pooky = 0, HYv= T2 5+ as gi (4.63) 


namely, the Lie algebra is Abelian in this case. As an illustration, let 
us consider the 2 x 2 matrix representation of the one dimensional 
translation group T; (see (2.104)) 


1 0 
U(ay= (; ) : (4.64) 
In this case, we can write 
U(a) =e, (4.65) 


where the generator of the infinitesimal group (or the Lie algebra 
element) is given by 


ee a 5) ee Se. (4.66) 


The Lie algebra is manifestly Abelian. 


4.3 Baker-Campbell-Hausdorff formula 


In the last section we obtained, in a circuitous way, the different kinds 
of Lie groups from their corresponding Lie algebras by exponentiating 
the basis elements of the Lie algebras. This can also be obtained more 
directly through the use of the Baker-Campbell-Hausdorf formula in 
the following way (see, for example, M. Hauser and J. T. Schwarz, Lie 
groups and Lie algebras; notes in mathematics and its applications). 

Let A denote an associative algebra such as the matrix algebra 
U(L) (namely, matrix representation of a Lie algebra L). For any 
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x,y € A, we can express 
ered = eh (ty), 
or, (x,y) =In(e*e!), (4.67) 


where the solution of (4.67) for ¢(a,y) € A is known as the Baker- 
Campbell-Hausdorff formula. The low order terms in ¢(x, y) can be 
determined explicitly as follows. Instead of (4.67), let us look at the 
expression 


eltety — ebltry) (4.68) 


where t is a small parameter. We can now make a Taylor expansion 
as 


o(t, x, y) ye on(2, y)- (4.69) 


On the other hand, we note that we can write 


Ob.t; a) =n (ener) = In (4+ (e*e'Y — 1) ) 


= yore et 1)" = Pon (gg). (4:70) 


n=l], 


Therefore, matching the powers of t, the function ¢,(z,y) can be 
calculated order by order and has the form 


da(e.y) = > EU" (ere - 1)" 


m=1 


(4.71) 


n 


where the restriction implies that the sum of powers of x and y should 
add to n in the expression on the right hand side. We can now 
calculate explicitly the first few orders of the function ¢,(x,y) in a 
straightforward manner, 


bo(z,y) = 90, 
olay) =0+y, 
oy ye 4 ae ot 
go(z,y) = taut > — slaty) = 5leyl, 
és(2.y) = zole.leull ~ aol be all (4.72) 
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and so on. 

However, such a calculation of ¢,(x, y) becomes extremely com- 
plicated as we go to higher orders. On the other hand, the solution 
of (4.67) can be written compactly as (the derivation is technical) 


1 
o(x,y) =a“2+ / ds w Cee y. (4.73) 
0 


Here (adz) is the adjoint operation defined in (3.103), namely, (adx)y = 
[x,y] and 


vw) 


I 
€ 
Dp 
& 
Ww 
I 
—-~ 
ra 
+ 
Se 
a 
p 
‘Pe 
| 
ah 
| 
€ 
a 


n=1 
[o-e) 1 [o-e) 
=e ye (1—w)” 
smart) d n+1 
= z nce: lw (4.74) 


Here w(w) is known as the generator of Bernoulli numbers in the 
sense that 


b(e*) = S$" Bn s (4.75) 


n=0 


where the coefficients B, denote Bernoulli numbers, namely, By = 
1B, =3,B,=%,Ba=-H.° 

Because of the definition of the adjoint operation, $(z,y) is a 
sum of a series of commutators (and commutators of commutators 
etc.) involving x and y. The first few terms of the series are given 


by (see also (4.72)) 


alt be-all ole bea) +- - (4.76) 


(zy) =xt+y4 slo 


As a special case of the Baker-Cambell-Hausdorff formula, let us con- 
sider the case when [x, y] = cl, where c is a constant, or more gener- 
ally 


[x, (2, y]] =O = [y, [x,y]. (4.77) 
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In this case, all the higher commutators vanish and we have 


d(t,y) =a2t+yt+ sl. yl, (4.78) 


and we obtain 


ered — ey) — ortytglesl — oglzal ott+y. (4.79) 


For example, if we identify 


r=iag, y= ip, (4.80) 


with a, (in general, complex) constants and q,p denoting the co- 
ordinate and the conjugate momentum satisfying the canonical com- 
mutation relation [g, p] = ih1, we have 


[x,y] = [iag, i8p] = —aB[q, p] = —thaB1. (4.81) 


In this case, using (4.79) we have 


ciedeibp — o~“*S piagtisp, (4.82) 


Similarly, for annihilation and creation operators a, a! (of the bosonic 
harmonic oscillator) satisfying [a,a'] = 1, we obtain 


t 1 “ t i t 
E24 eba —e5 ap coat Ba : cha ee = 73 aB coat Ba : (4.83) 


These relations can be equivalently written as 


t i t 
exer Pa ap ee ext 


— e2 = or ap oor eBal (4.84) 


These relations are relevant in applications in quantum optics (in 
connection with coherent states). 

If U(L) denotes the matrix representation of a Lie algebra L, 
then we note, from (4.76) (as well as the properties of a Lie algebra), 
that for any x,y € U(L), we have $(z,y) € U(L). Therefore, if we 
define 


G = e¥) = {the set consisting of eV}, (4.85) 


then this implies GG € G so that a product of the elements e*e¥ 
is well defined and is contained in the set G. Clearly e® = 1 is the 
unit elelemnt with the inverse e~* satisfying e-*e* = 1 = e*e™”. 
Therefore, the set of G consisting of all e” forms a group. This 
procedure of obtaining a group is known as exponentiation of the Lie 
algebra. 
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> Example (Construction of unitary representation). As we have mentioned above, 
a representation of the Lie group can be obtained from a given Lie algebra through 
exponentiation using the Baker-Campbell-Hausdorff formula. However, the ex- 
plicit construction is, in general, quite difficult. Here we discuss such a construc- 
tion for a simple example. 

Let a,b,e denote three operators satisfying the commutation relations 


[a, b] = —[b, a] =e. 
[a, e] = —[e, a] = 0, 
[b, e] = —le, b] =0. (4.86) 


It can be checked easily that the commutation relations satisfy Jacobi identity. 
For example, 


[a, [a, b]] + [a, [b, al] + [6, [a, a]] = [a, e] — [a, e] +0 = 0. (4.87) 


Since the commutators are anti-symmetric and satisfy Jacobi identity, the ele- 
ments a, b,e define a 3-dimensional Lie algebra. It follows now that 


U(a, 8,7) = erat Phe, (4.88) 


generates a Lie group for real or complex constant parameters a, 6,7. 
To see this, let us define 


X;=aia+ $ib+ ye, 


X92 = aoa Bab y2e, (4.89) 


which leads to 

[X1, X2] = a1 62[a, b] + a261[b, a] = (a1 G2 — a28i)e. (4.90) 
As a result, we have 

[X1, [X1, X2]] = [Xe, [X1, X2]] = 0, (4.91) 
as well as higher commutators in (4.76) and we can apply (4.78) to write 

er! eX? — oki tXo+9[X1,X2] 

= plortar)at(B1+82)b+(yatI2t 3 (1 B2-a281))e _ Xs | (4.92) 

where 

X3 = a3a + 83b+ 43e, (4.93) 
with 


1 
azg=aitar2z, ~=hith, w=ynt+re4 5 (0182 a2 1). (4.94) 


Equivalently, we can write 
U(ar, 61, 71)U (a2, B2, 72) = U(as.Bs, 73), (4.95) 
We note from the definition (4.88) that 


U(0,0,0) = 1, (4.96) 
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and since by assumption the product of elements U(a, 8,y) is associative, the set 
of all elements forms a (Lie) group. However, in physics the unitary representation 
of a group is important, namely, we require 


U' (a, 8, y)U(a, 8,7) = 1, (4.97) 


and we discuss two examples of how the unitarity condition can be implemented. 


Case I: Let us assume that 

b=al, e=1, (4.98) 
so that (4.86) can be written in this case as 

[a,a"] =1, (4.99) 


with all others vanishing. As a result, we can think of a, a! as the annihilation and 
creation operators of the bosonic harmonic oscillator and the unitarity condition 
(4.97) can be satisfied if we choose 


B=-o", y= -+. (4.100) 


Therefore, we can write y = iA for a real constant parameter and the group 
element (4.88) can be written in the form 


Uap Veer oe, (4.101) 


which is manifestly unitary. The group elements act on (harmonic oscillator) state 
vectors given by 


|0), n=0,1,2,--- ’ (4.102) 


where the vacuum state |0) satisfies a|0) = 0. 
Case II: As a second example, we can consider 


at=a, bl=b, ef e. (4.103) 


For example, we can identify a = x,b = p,e = ihl1 so that the Lie algebra in (4.86) 
can be written as 


[x,p] = hl, (4.104) 
with all others vanishing and a group element (4.88) has the form 

U(a, B,y) = ett TOP teat (4.105) 
From (4.103) we note that «1 = x,p' = p so that if we choose a* = —a,§* = 
—B,y* =7, or equivalently 

U(a, 8,7) = elle tertin), (4.106) 


with a, 8,7 real, the group element is formally manifestly unitary. 

In quantum mechanics x denotes the coordinate operator while p represents 
the momentum operator, both of which are non-compact. Therefore, properties 
such as Hermiticity or unitarity depend on the space of functions on which the 
operators act. In the present case, the operators (x as well as p = —ihs) act on 
functions f(a) with the coordinate taking values —co < x < oo. However, as de- 
scribed in chapter 2 (see (2.112)-(2.123)), the function f(a) has to be differentiable 
and has to vanish (or oscillate to zero) asymptotically. 

< 
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> Example (Combination of rotations). Although the Baker-Campbell-Hausdorff 
formula is too complicated for general application, in particular cases they can be 
used in a simple manner. We have already discussed one such simple case in the 
last example. Here we will describe how to combine rotations involving both the 
compact group SO(3) as well as the non-compact group SO(1, 2). 

Let us start with the compact group SO(3) with the Lie algebra given by 


[Ji, J;] — leigkIk, 4,9, k= 1, 2, 3, (4.107) 


where €;;~ denotes the three dimensional Levi-Civita tensor (and repeated indices 
are summed). As we have already seen, for SO(3), the generators J; are Hermi- 
tian so that Ji = J;. Following our discussions in this chapter, we note that a 
general group element of SO(3) can be written as e’%'7i where repeated indices 
are summed and the constant parameters 9; describe the components of rotation 
along the three different axes. 

We consider next a product of three group elements of the form (see (4.82) 
or (4.83)) 


e+ eBI- giiJs_ (4.108) 


where a, 8,7 are, in general, real or complex constant parameters and, as before, 
Jz = Ji tide. We note that since the product of group elements gives rise to a 
group element, we can, in general, write the product in (4.108) as 


, (4.109) 


e+ efi V3 — efi ti 


where 6; = 0;(G, 8,7). 

To determine the parameters 6; (and, therefore, the product in (4.109)) we 
note that the formula (4.109) holds for any N x N matrix representation of SO(3) 
and, in particular, for the 2 x 2 representation where the generators are related 
to the three Pauli matrices as 

Jj = 5% 
so that we have 


(0 1 ee dy TE, 6 
n= (9 a) Rema a n= 5o=3 (i 21). @at9) 


We note from (4.110) that in this 2 x 2 matrix representation 


Sas se (4.111) 
which leads to 
Bar he ery ed come) Me aes ee ae en (4.112) 
+ 0 1)? - Bol 
Furthermore, since J3 = 403 is diagonal and o3 = 1, it follows that 
7 y_(eF 0 
iq J3 Pree _ 
—1 Lio: oan ee 4.11 
e€ cos 5 + tos sin 5 ( ; 3) ( 3) 


With these simplifications we determine the left hand side of (4.109) to be 


- G + aB)e™ — . nee 


118 4 RELATIONSHIP BETWEEN LIE ALGEBRAS AND LIE GROUPS 


On the other hand, to evaluate the right hand side of (4.109), we note that 
we can write 


err aga Oe, (4.115) 
where 0 = \/0? + 63 + 63 denotes the magnitude of rotation and 6 = 4(0:% + 


O29 + 632) represents the unit vector around which the rotation is performed. 
Using the properties of the Pauli matrices we can now determine 


ei — 1 cos ras i0-osin 
_ (cosa t Se sing MGM sing (4.116) 
i(O1 4460) sin $ cos $ = ia sin 3 
Comparing with (4.114) we can determine 
cos $ = ; (1+ aB)e? +e?) F 
4 0 iF 
uc 7 102) sin 5 = ae 2, 
i(O1 + 102) 6 = 14 
= 2 
9 ga Res 
sing =—3 ((1+a8)e? —e?). (4.117) 


Here we have combined a product of group elements into a single element 
and consequently have determined 6; = 0;(@, 8,7). However, the converse proce- 
dure of separating a group elements into a product of group elements (in a normal 
ordered form), as we have described in (4.83) and (4.84), is often more useful. 
Here we will discuss this procedure in connection with the non-compact group 
SO(1,2). As we have already discussed (see (3.81)-(3.91)), the Lie algebra of 
so(1,2) is isomorphic to the angular momentum algebra (assuming that the alge- 
bra is defined over complex fields which we are assuming in our discussions) where 
not all the generators are Hermitian. The algebra can, in fact, be described in 
terms of the creation and annihilation operators of the one dimensional oscillator 
in the following manner. Let us choose 


J3 = 


Nile 


1 i i 
(alas 5) » = 5 (a)?, J_= a’, (4.118) 


with [a,a'] = 1. (In the language of (3.57), we can identify A’, = $(a'a+1/2) = 
—A’%,, Al, = t(at)?, Ay = ia?) Then, it can be checked that the generators 
J3, J+ satisfy the angular momentum algebra 


[J3, Jz] =+J4, [J+,J-] = 23. (4.119) 


Although the algebra is the same, as we have mentioned, the difference from the 
case of so(3) arises from the fact that here not all generators (basis elements) are 
Hermitian. In particular, we have 


(GS 292, (4.120) 
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so that the generators Ji, J2 are not Hermitian 


long Dee) pee A 
A= 5 (J++ 5-)=3 (2+ @)) = J, 


=> (Jy —J_) = AG (at)?) = a (4.121) 


In fact, we note that in this case, the quadratic Casimir operator is given by 


h=P=R+B+B 


=-—1, (4.122) 


where we have used the commutation relations between the annihilation and cre- 
ation operators. Recalling that the eigenvalues of the Casimir operator (for angu- 
lar momentum) can be written as j(j + 1) (see (3.122)), in this case, we have the 
unusual value of 7 = —t. In fact, this case corresponds to an infinite dimensional 
unitary representation of the non-compact group S'O(1, 2) for which x2 — a? — «3 
defines the invariant length (see also discussion following (3.90)). 

For applications in some branches in physics including quantum optics, let 


us see how a group element of the form 
ener (4.123) 


can be decomposed into a normal ordered product (with respect to the creation 
and annihilation operators, namely, in a form where the creation operators are on 
the left of the annihilation operators) of group elements as described in the earlier 
example. To do this, let us rewrite 


e941 +92 J2) _ ed (81-102) Jy + (01 +102) J ) 


= eo ¥ (01 -182)(at)?—§ (01 +182)0? (4.124) 


Comparing with the right hand side, we note that this case corresponds to 63 = 0 
and 6 = \/6? + 62. Therefore, we can write (see (4.109)) 


(91 J1 +82 J2) a 3 (01-102) J4 + (01 +102) J ) = ett ef s- e773 


3 (01-82)(ab)?— 5 (O1+i02)a? _ AR (al)? a? A (alats) 


or, e ; 


or, ef ON+ga? _ (Fal? (Fa? 62 ahaa). (4.125) 
where we have defined 
_ i(01 + 162) _ i(01 — 102) = 88 
q = EE pa SE as =-. 
Comparing with (4.117) we obtain 


_ (01 — 162) 0 B 

Qa 7 tan 5 = tan ./—ap, 

a i(O1 + 102) 5 _ il a . 

B 30 sin 0 3 \/ —B sin 2,/—aB, 

Fy = 2ilog (cos V-aB) F (4.127) 


(4.126) 
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Note that acting on an energy eigenstate |n) of the harmonic oscillator, the last 
factor on the right hand side of (4.125) is a constant so that the operators are 
indeed normal ordered. Relation (4.125) can be thought of as the analog of (4.84) 
for the group SO(1, 2). 

< 


4.4 Ray representation 


In quantum mechanics, the Schrédinger wave function ~(x) has a 
phase ambiguity in the sense that both w(x) and w(x) = e w(x), for 
a real constant 0, represent the same physical state (since the proba- 
bility density p(x) = |w(x)|? is independent of the phase). Therefore, 
for applications in physics, we can also allow a phase ambiguity in 
the representation U(g) of a group G in the composition relation (see 


(2.2)) 


U(g1)U (92) = e') U (gig), Vor, 92 € G, (4.128) 


for some real phase function w(g1, 92). If w(g1, 92) = 0 for all group 
elements, then it reduces to the standard representation of the group. 
However, when w(g1, 92) # 0 for all the elements of the group, (4.128) 
leads to a ray representation of the group G. We note that U(g) has 
to satisfy the associative law (see (2.6)) 


(U(91)U (92) U(9s) = U(g1) (U(92)U (93) ; (4.129) 
and since (4.128) leads to 


(U(g1)U (g2)) U (93) = e(9-)U (g1.92)U (93) 
= et(91,92) +4(91.92:93) [7 (G4 gags), 
U(g1) U(g2)U (gs) = e999) U (gi U (gags) 
= et(92,93) +4(91:9293) [7 (G1 gags), (4.130) 


for a ray representation we must have 


w(9192,93) — w(91, 9293) = w(g2, 93) — w(g1, 92): (4.131) 


w(g1,g2) is called the cocycle of the group G and (4.131) is known 
as the cocycle condition. (In deriving (4.130) we have assumed the 
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associativity of the group composition (g1g2)93 = 91(9293) = 919293 
as in (1.2).) 
Let us next define a new function 


V(g) = &*U(g), (4.132) 
where 6 denotes a real constant parameter. (We can view (4.132) as 


a group transformation without specifying its physical significance.) 
The new functions would satisfy the composition relation 


V(g1)V (go) = eb) +92) 7 (g JU (go) 
= e'(9(91)+8(92)+(91,92)) U(gig2) 


= VY (gi 9p), (4.133) 


where we have used (4.128) and have identified 


w(g1, 92) = w(g1, 92) + O(91) + (ge) — O(9192). (4.134) 


It follows now that if the cocycle w(gi, g2) satisfies 


w(g1, 92) = O(gig2) — (91) — O(g2), Von, g2 © G, (4.135) 


for some 6(g), then, we will have (see (4.134)) 


@(91,92) = 0, (4.136) 


so that V(g) would lead to the standard group representation of G. In 
this case, the cocycle is called trivial. For some non-compact groups, 
the study of unitary ray representations is essential in applications 
in physics. A familiar example arises in the study of representations 
of the Galilean group (see Bargmann). However, it is much easier, 
in general, to study the ray representation of the Lie algebra rather 
than that of the Lie group. On the other hand, this is achieved at 
the expense of modifying (extending) the standard Lie algebra in a 
manner which we illustrate with the following example. 

Let us consider the case of 2-dimensional translation group T> 
(T7(2)). As in the case of Ti, a unitary representation of T> can be 
given by (see (2.72) and (2.76)) 
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0 0 
U(ay, a2) = exp (age + a5] , (4.137) 


of two real variables x; and x2 with two real parameters a, and ag 
operating on functions f(21,22) with norm given by 


[IF 1I? = / dard | f (x1, x2)|? < 00. (4.138) 


—oo 


The corresponding Lie algebra consists of the basis elements 


Di — Do = ; (4.139) 


which satisfy the Abelian Lie algebra 


[D,,.Ds)'=0. (4.140) 


To obtain a unitary ray representation of 75, the simplest ap- 
proach is to consider again the annihilation and creation operators a 
and a! respectively (of the one dimensional harmonic oscillator) satis- 
fying (see (2.130) in connection with the one dimensional translation 
where we have identified a > a and so on) 


[a,a'}=1, [a,a] =0= [al al). (4.141) 
Let a denote a constant complex parameter and let us consider the 
transformations 

avd =a+al, atoat=al+a°*l, (4.142) 
where a* denotes the complex conjugate of a. It follows from (4.141) 
and (4.142) that 

[a’jat}=1, [a’,a]=0= [at a4), (4.143) 


so that we obtain a representation of the corresponding three dimen- 
sional group through the relation 
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U(a) = exp (a*a — aa) ; (4.144) 


which formally corresponds to a unitary representation. (This is also 
invariant under the transformation (4.142).) 

We note that for a different constant complex parameter 6, we 
can write 


U(6) = exp (s*a — Bat) : (4.145) 


If we make the identifications 


A=a*a—aa', B= f*a-— Bal, (4.146) 


using (4.141), we obtain 


[A, B] = [a*a — aal, B*a — Bal] = — (a*B — af") 1. (4.147) 


Furthermore, using the Baker-Campbell-Hausdorff formula, namely, 


‘li 
ApB — A+B Q$IA.B] 


Ce = = ¢2/4.8) enne, (4.148) 


when [A, B] is a constant and is proportional to the identity operator, 
we now obtain 


U(a)U(B) = e4e? = e2l4.B] A+B _ piv(ab) U(a+ 8), (4.149) 


where we have identified 


a * * 
w(a, B) = 5 (a"B— af"). (4.150) 
Thus, (4.144) truly furnishes a unitary ray representation of the three 
dimensional group. Moreover, the connection of this representation 
with that of Tz can be made if we make the identifications (see (4.137) 


and (4.139)) 


1 
a@=a,—tia,, a=D= 5 (D, — iD). (4.151) 
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where ay = Rea,a2 = Ima. Namely, we can rewrite (4.144) as 


U(a) = exp (a, D, + a2D2). (4.152) 


However, D, and D2 cannot commute any more (see (4.140)) since 


[a,a"] =[D, D'] = 1, (4.153) 


which translates into the commutation relation 


(D1, Dy] = 2i1. (4.154) 


As a result, the Lie algebra, in the present case consists of three basis 
elements D;, Dz and 1. The addition of an extra element (in this case, 
1) to the standard Lie algebra of a Lie group, with some modification 
in the commutation relations, to obtain a ray representation for the 
group is quite general. When the extra element commutes with all 
the other elements of the algebra, it is known as the Abelian extension 
or the central extension. 
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CHAPTER 5 
Irreducible tensor representations and 
Young tableau 


In chapters 2 and 3, we have constructed simple representations of 
various Lie groups and Lie algebras. In this chapter, we will discuss 
how more complicated representations can be constructed and how 
the Young tableau or the Young diagrams (named after the British 
mathematician Alfred Young) prove to be of great help in this direc- 
tion. For simplicity we will restrict ourselves to the U(N) and SU(N) 
groups. 


5.1 Irreducible tensor representations of the U(V) group 


Let us change notations (compared with, say, chapter 2) and write 
a nonsingular N x N matrix in this section as (namely, we are rep- 
resenting the matrix A in the defining (fundamental) representation 
also by the same symbol as the group element a to avoid proliferation 
of symbols) 


ag a “a? a,” 

Weil G44, o Ge. te (5.1) 
1 2 3 N 
an an an an 


with aj, = a,! (see, for example, (2.18) and (2.21) where a;; was 
denoted as a;; to avoid confusion with the annihilation and creation 
operators) for reasons to become clear soon. We note that a general 
linear transformation of an N-dimensional covariant vector can now 
be written as 

ie Nee TT (5.2) 
where u,v = 1,2,--- , N and summation over repeated indices (v) is 
understood. We have already noted (see the discussion at the end 
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of section 1.2) that the general linear group GL(N) consists of the 
set of N x N nonsingular matrices a (namely, deta 4 0 so that a7! 
exists). Therefore, we can denote an N dimensional representation 
of GL(N) by the N x N dimensional matrices (the meaning of the 
subscript “1” will become clear within the context shortly) 


Lyla) =a: (5.3) 


In this case, Uj(a) may be regarded as describing a linear transfor- 
mation of the covariant vector x, in this N-dimensional space by 
(repeated indices are summed) 


Ui(a): «> Uy(a)z, 


or, 24> (Ui(a)z), = 2), = 4,) ay. (5.4) 


It can be easily checked, as in chapter 2, that these matrices 
satisfy all the properties of a group and, therefore, provide a repre- 
sentation. For example, 


(Ur(e)x)y = (Ln) ftv = 6," tv = Xp, (5.5) 


so that we can identify U;(e) = Ey = 1y. Similarly, we have 
(Ui(a@)Ui(6)a)y, = (Ui(a) (U1(0)@)),, = a, (U1 (b)2), 
= 4,/b) xy = (ab) 2, = (Ui(ab)z),,, (5.6) 


where we have used the usual definition of the matrix product (ab) i, = 
ay b,. (with the convention that repeated indices are being summed). 
This then implies that U;(a)U1(b) = Ui(ab) and that the matrices de- 
fine an N x N dimensional matrix representation (or simply an N 
dimensional representation) of GL(N). This is conventionally known 
as the defining representation or the fundamental representation for 
any group and is obtained by simply considering the linear transfor- 
mation of the covariant vector x,,. (Namely, the covariant vector x, 
provides the simplest representation space for U(N) (or SU(N))) 

However, we are interested mostly in the groups U(N) and 
SU(N) and we recall that the U(N) group is defined by the set of all 
N x N nonsingular matrices a satisfying the unitarity condition (see 
(1.48)) 


ala = Ey = aa’. (5.7) 
Furthermore, if we impose the additional condition 


deta =I, (5.8) 
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on the matrices they lead to a representation of the group SU(N). 

Just as (5.4) defines the transformation of a covariant vector, 
we may similarly consider an N-dimensional contravariant vector «4 
which transforms inversely as 


t# > (U_s(a)a)* =a =a" (a) = (a) 2”. (5.9) 
As in (5.5)-(5.6) we can check that 
Usa(a) =a, (5.10) 


also defines a representation (since a is assumed to be nonsingular 
a‘ exists). On the other hand, (5.7) for U(N) and SU(N) implies 
that 


Cac) ie Crae (5.11) 


so that we can write (5.9) also as 
a> Ualaa ye Sa" = (ar) av. (5.12) 


Comparing with (5.4) we see that this allows us to identify the con- 
travariant vector as the complex conjugate of the covariant vector, 
namely, 


oS (gy) (5.18) 
so that we can also identify 
U_i(a) =(Ui(a@))", (5.14) 


to be the complex conjugate representation of the defining represen- 
tation U;(a). This is also consistent with (5.10) together with the 
unitarity condition (5.7). 

We can now construct more complicated representations as fol- 
lows. Let x, and y,, be two N-dimensional covariant vectors. Then 
the second rank covariant tensor Ty, = %pyv, f,Y = 1,2,---,N 
transforms under U(N) or SU(N) as (a, 6 = 1,2,--- , N) 


Tye Pa Tap, (5.15) 


Since T),, has N ? independent components, the transformation should 
generate a N?- dimensional or a N? x N? matrix representation of 
U(N) or SU(N). This is easily seen by recognizing that we can ar- 
range the N? components of the tensor T),, into a column basis vector 
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of the form T4 = Ti,,,),A = 1,2,--- ,N?, where, say for example, 
Ty = T1172 = Tha,--- Tw = Tin, Tg = Ta,--: ,Ty2 = Tn so 


that the transformation matrix (summation over B = 1,2,--- , N? is 
understood) 
T, > Ugp(a)Tzg, Uag(a) = U (u,v)(a,8) (2) ae aa. (5.16) 


isa N?x N? matrix. We can readily verify the validity U3 (a)Upc(b) = 
Uac(ab) (repeated indices are summed) so that this forms a repre- 
sentation of the group. 

However, it is easy to show that such a representation is not irre- 
ducible in the following way. Let us define, from T,,,,, the symmetric 
and the anti-symmetric tensors S,,, and A,,, respectively as 


1 
Suv = 3 (Tuy + Trp) = Sop, 
1 
A= 5 hie = As (5.17) 


so that we can also write 
pea per ee (5.18) 


These tensors also transform as in (5.15) which can be seen from 


1 
oe = 5 (ear Ta) > 5 aa Tap + 3 aera em 
1 
= 4,4, 3 Fas + Ta) 


= 6," as Sap% (5.19) 
Similarly, we can show that 
Aw > 0,0 a, Aag. (5.20) 


Therefore, both S,, and A,, also provide representations for the 
group of dimensions $ N(N-+1) and $ N(N-1) respectively. Namely, 
Sy, acts on vectors in a 5 N(N +1) dimensional vector space V; while 
Ay, acts on vectors in a 5N (N — 1) dimensional vector space V3. 
Furthermore, V; and V2 define invariant spaces, namely, under the 
transformation, the vectors in these two spaces do not mix (see, for 
example, section 2.2, particularly, the discussion towards the end of 
that section). If we write the basis vectors representing T),, in terms 


of the corresponding basis vectors of S,,, and A,,, then these basis 
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vectors would define invariant subspaces of T,,, so that T),, is fully 
reducible as also can be seen from (5.18). Counting the dimensions 
of the representations we see as well that 


N? = SN(N+1)4 5 N(N - 1). (5.21) 


Furthermore, the representations given by S,,, and A,, can be shown 
to be irreducible, namely, there exist no other nontrivial sub-tensors 
(of lower dimension) of Vj and V2 which are invariant under U(N) (or 
SU(N)). Therefore, this shows that U(N) (or SU(N)) has two irre- 
ducible (second rank covariant tensor) representations of dimensions 
5 N(N +1) and § N(N — 1). 

For the mixed tensor T),” = x,y”, the transformation property 
under U(N) (or SU(N)) is obtained to be (see (5.4) and (5.9), re- 
peated indices are summed) 


W\ VU -1)\ / B 
Ly SL), =a (ae )y Ter (5.22) 
and yields a N? x N? matrix representation. However, as before this 
representation is not irreducible. To obtain the irreducible represen- 
tations, we note that we do not have a symmetric and anti-symmetric 
decomposition for a mixed tensor. Rather, let us define (summation 


over repeated indices is assumed) 
Oat a (5.23) 
which is invariant under U(N) (SU(N)) since 


a (Gaars Te 


ToT =(T') h=a, 


=a," (ane) To = 53T,° 


ap og (5.24) 
Namely, T provides a trivial one-dimensional representation of U(V) 


(SU(N)). 


As a result, we see that 


1 
De ae (5.25) 


transforms exactly as T,,” under U(N) (SU(N )). However, it satisfies 
the traceless condition (recall that pp = 1,2,--- ,N), namely, 


~~ 1 
The (x," a! r) Se ee a (5.26) 
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Consequently it defines a (N? —1) dimensional matrix representation 
of U(N) (SU(N)). Furthermore, from the definition in (5.25) we note 
that 

TY =T ter (5.27) 

HL as a 
leads to the correct number of dimensions for the representation of 
T,,”, namely, N? = (N?—1)+1 as expected. We can verify that the 
traceless tensor T, LP is irreducible. 

The situation becomes more complicated for tensors of larger 
rank. As an example, let us consider a tensor of the form T,,,, = 
LyYvz Which gives a N 3_dimensional representation space. This 
space is again reducible and the decomposition of this space into 
invariant tensor spaces is more involved as can be seen in the follow- 
ing way. Let o denote a generic permutation of the indices u,v and 
A such as wu > vy > A> pw. Let us define the two third rank tensors 
by summing over all possible permutations as 


Suvr = = 31 a a(p)o(v)o(A 


1 
=a (Tin + Typ + Dy ww oF Tyr =F Td a Troy) ’ 
3! 


1 o 
Aw = 31 ae To(u)o(v)o() 


1 
3! 


where the weight factor (—1)° introduces a negative sign for every 
pair of indices that are interchanged. We note that S,,,, is mani- 
festly totally symmetric (under the exchange of any pair of indices) 
while A, is manifestly totally anti-symmetric. The total number 
of independent components of these tensors are given respectively by 


D(S) = (N +1)(N +2), 
D(A) = 7 N(N L)(N — 2), (5.29) 


and we note that D(S) + D(A) = 3 N(N? +2) < N° (equality holds 
only for the trivial case N = 1), so that these two classes of tensors 
alone cannot reproduce the whole set of T),,,. We need to consider 
more complicated tensors other than the familiar classes of totally 
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symmetric and totally anti-symmetric tensors. For this we proceed 
as follows. 

Let us consider a third rank (mixed symmetry) tensor M,,, 
which is obtained from T),,, by first symmetrizing in the first two 
indices (4,v) and then anti-symmetrizing in the first and the last 
indices (ju, A) as 


1 
Mur = 3! (Twa + Ties) ~ (Drop a Tx) : (5.30) 


This tensor satisfies the symmetry relations 


Myr = —Myvp, 
Mw» oe Myr 2 Mw = 0. (5.31) 


Note that the first relation in (5.31) says M,,,, does not contain 
the totally symmetric tensor S,,,, while the second relation (together 
with the first relation) implies the absence of the totally anti-symmetric 
tensor A, in My, . 

Similarly, we can construct another third rank (mixed symme- 
try) tensor M ioe which is obtained from T),,, by first symmetrizing 
in the (wA) indices and then anti-symmetrizing in the (juv)indices. 
More explicitly, this tensor has the form 


1 
jr = 3p (Tyr + Trp) — Tour + Tryv)) (5.32) 


Correspondingly, this tensor satisfies the relations 


a 77 =Mi ax 
Mir + Moxa + My = 0. (5.33) 


Both M,,, and M tos have the same number of independent compo- 
nents given by 


D(M) = D(M’) = 5 NN? ~ 1). (5.34) 


It is easier now to check that the tensor T),,, can now be expressed 
uniquely in terms of these four third rank tensors as 


Tir = Onn + Avigh + (Myr a Mtv») 9 (Miu a Moxy) (5.35) 
leading to the correct number of total dimension, namely, 


N® = D(S) + D(A) + D(M) + D(M’). (5.36) 
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Let us note here that although we constructed the (mixed sym- 
metry) tensor M,,, by first symmetrizing in the (yw) indices and then 
anti-symmetrizing in (WA), the operations are commutative. Namely, 
we could have started by anti-symmetrizing in the (4A) indices and 
then symmetrizing in (juv”) and the final result would not be the same, 
but would lead to an equivalent result in the following way. For ex- 
ample, constructed this way, the (mixed symmetry) tensor can be 
written as 


Myr = x ((Tyvra — Trop) + (Topa — Try) (5.37) 
which satisfies the relations 

Mar = Mi, 

Myr + Moray + May = 0. (5.38) 


The two relations in (5.38) imply the absence of the totally symmetric 
tensor S,,, in M,,,. The number of independent components of 
My» is exactly the same as that of M,,,, and can be expressed as 
the linear superposition 


~ 1 
Mur = 2 (Miva + Myx) a5 (Min e5 Mir) ’ (5.39) 


simply corresponding to a change in basis. All these tensors are 
known to be irreducible. It is clear now that constructing higher 
rank irreducible tensor representations algebraically is a formidable 
task. However, such irreducible tensors can be naturally written in a 
diagrammatic form through Young tableaux which we discuss next. 


5.2 Young tableau 


A Young tableau or a Young diagram is a combinatoric diagram which 
appears in various areas of mathematics and physics. It is a collection 
of rows of boxes (with entries in them) left justified and drawn such 
that a lower row never contains more boxes than an upper row. So, 
for example, if a Young tableau (diagram) has @ rows, then it is alge- 
braically designated by (fi, fo,--- , fe) where f; denotes the number 
of boxes in the 7 th row and they satisfy the relation 


pe ts ee fre): (5.40) 


The irreducible representations of covariant tensors for the groups 
U(N) and SU(N) can be naturally written in terms of Young dia- 
grams in the following manner. In this case, we identify the number 
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of rows to coincide with @ = N and identify the entry in a box witha 
tensor index. The tensor structure represented by a Young diagram 
is constructed by first symmetrizing in the indices along a given row 
and then anti-symmetrizing in the indices along a given column. In 
this way, a Young diagram represents a unique tensor representation. 

For example, the simplest Young diagram for U(N) (more gen- 
erally for GL(N) or SL(N)) corresponding to the fundamental rep- 
resentation has only one row with only one box with a vector index 


N-1 
——_—_- 
tu: Lb = (10,0, 94250) =A]; (5.41) 


where the square bracket representation simply (neglects the zero 
elements) assumes that the rest of the (NV — 1) entries vanish cor- 
responding to the fact that the diagram has only one row with one 
box. (Since there is only one index, the question of symmetry or 
anti-symmetry does not arise.) Similarly, turning to irreducible ten- 
sor representations of second rank, we have already seen that there 
are two of them (see (5.17)) and we can represent them respectively 
as 


N-1 
rs ——. 
Sup: Lt v | = (2,0,0,--- ,0) = [2], 
N-2 
Ap: , = (1,1,0,0,--- ,0) =[1,1]. (5.42) 


The symmetry properties of these tensors are manifest. Since Sj, 
is represented by a single row with two boxes with indices 4, v, it is 
symmetric in those indices while A, is denoted by two rows with 
one box in each column (containing indices ju, v), it is anti-symmetric 
in the indices. 

Similarly, we can write the Young diagram for the third rank 
tensor M,,,, as 


N-2 


——\—> 
ee Cm = (2, 1,0, 0, - = iO) = [2, 1], (5.43) 


while that for M Wa has the diagrammatic representation 


N-2 
——_——r 
oe BA = (2,1,0,0,--- ,0) = [2,1], (5.44) 
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As another example, let us consider a tensor T),,«gyr With the 
Young tableau 


| V | N-3 
|a| 6 = (6, 2.2,0:0.052 0): 18,401: (5.45) 
Bak: 


which can be obtained by first symmetrizing all rows (namely, sym- 
metrizing in the indices y,v,A and a, as well as in y,7) and then 
anti-symmetrizing in all columns (namely, in pz, a, y and v, 6,7). Equiv- 
alently, we can first anti-symmetrize in all columns and then sym- 
metrize in all the rows. This can be generalized for any covariant 
tensor. The general Young tableau for a covariant irreducible tensor 
is given by N symbols [f1, fo,--- , fy] satisfying 


Jt fe lg AS fe O: (5.46) 


Note that we do not need more than N symbols which can be seen 
as follows. Consider a totally anti-symmetric tensor Ajjyo.--py41 


with N + 1 indices. The corresponding Young tableau may be de- 
N+1 


scribed as [1,1,- Pout with fi = fo =--- = fui1 = 1. However, 
in the N-dimensional case that we are discussing, the tensor indices 
11, f2,°°* ,n41 can only take N distinct values 1,2,--- ,N. There- 
fore, at least two of the tensor indices must have the same value. On 
the other hand, since the tensor is anti-symmetric in all its indices, 
in this case it must identically vanish, namely, Ay, p.....y4, = 9. The 
same argument also holds for any irreducible tensor with the Young 
tableau [f1, fo,--- , faz] with M > N and fy > 0. 

The dimension of an irreducible tensor representation can also 
be calculated using the Young diagram (for the representation) in the 
following way. First, we introduce the concept of a hook length of a 
given box which is a unique integer associated with a box in a given 
diagram. It simply corresponds to the sum of the number of boxes to 
its right (along the given row), the number of boxes below it (along 
the given column) plus one (corresponding to the box under consid- 
eration). Then the hook length associated with any given diagram 
corresponds simply to the product of all the hook lengths of boxes 
in that diagram. Next, for SU(N), let us label the top most left 
box as N and increase this integer by unity for every box to its right 
and decrease it by unity for every box below it. This gives a unique 
labelling of the diagram by integers and the product of all these inte- 
gers divided by the hook length associated with the diagram gives the 
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dimensionality associated with the tensor representation associated 

with that diagram. For example, following these rules, we can im- 

mediately calculate the dimension of the respresentation associated 

with the mixed symmetry tensors in (5.43) and (5.44) to be 
N(N+1)(N-1) 1 


D(M) = D(M') = eee =e N(N?—-1), (5.47) 


which coincides with (5.34). Similarly, for the general tensor repre- 
sentation in (5.53), the dimension of the representation is obtained 
to be 
N(N +1)(N + 2)(N — 1) N(N — 2)(N — 1) 
5x4x1x3x2x2xl 


= | y(n -1)(N? -1)(N? 4). (5.48) 
240 

As we will point out towards the end of this section and in the 
next section, only covariant tensors are sufficient to describe irre- 
ducible tensors in the case of SU(N) group, while both covariant and 
contravariant tensors are necessary for the U(N) group. Because of 
this we need to generalize the Young symbol [f1, fo,--- , fy] to admit 
negative integer values for some of the f;s. As a simple example, con- 
sider the conjugate representation denoted by a contravariant vector 
x" (see discussion in (5.9)-(5.14)). This has the simple Young dia- 
gram representation 


gle [| =(0,0,--- 0,-1)= [1] Sf (5.49) 


The (—1) here refers to one contravariant index which transforms 
inversely from a covariant index. Similarly, we can verify that the 
traceless tensor T),” (see (5.25)) is irreducible and we can represent 
this as 


N-2 
~ _—~__— ‘ 
T," = (1,0,0,--- ,0,-1) = (J, [-1]) = (A), [1)"). (5.50) 
As another example, let us consider the mixed tensor fies sat- 
isfying 


1 2onaT. (5.51) 


as well as the traceless condition 


N 
» 
Bae Stet (5.52) 
A=1 
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We may represent such a tensor more conveniently as 


N-3 
(c: v |, ) = 2.50 0—1,-1) = (eh tat 


(5.53) 


allowing negative integer values for f;s corresponding to contravari- 
° . VV: DV 
ant indices. More generally, let the mixed tensor T),; lee ‘lp COTTe- 


spond to the Young symbol (f1, fo,--- , fr) for the covariant indices 


(141, M2,°°* sp), While the contravariant indices (v1, 12,--- ,¥q) corre- 
spond to the Young symbol (91, g2,--- , Jm)so that we can represent 
it as 

(fi, fase: tals ois gar »9m)*)- (5.54) 


Written this way, f;, 9; satisfy the conditions 

ii fot eS fa, 

9292 2°°' 29m > 9. (5.55) 
Moreover, we require the mixed tensor to satisfy the traceless condi- 
tion 


AV2Q0Vq __ 
Bs = (5.56) 


for any pair of indices (u;,v,), for j = 1,2,--- ,pandk=1,2,--- ,q 
As we will argue, it is sufficient to consider only the cases satisfying 
n+m < N. Introducing the negative numbers fy—m+1, fy—-m+25°°' > fn 
by 


fu-mt41=—9m, fv-m42=—-9m-1, *°°> fv =—91, (5.57) 
it follows that (see (5.55)) 
OS frome tev 2 he 2s 8 Sins (5.58) 


We designate the tensor to have the generalized Young symbol 


N-n-—m 
——_ 
(fi, fa,-°* ins 0,038" 0; iyi tw), (5.59) 
satisfying 
N-n-—m 


—_—_ 
fi Py See Ty, = OU se 2 ,0> fu—m+1 ees iN (5.60) 
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The reason why we can restrict ourselves to the cases n +m < 
N is that we can reduce the problem by lowering the contravariant 
indices 11, 2,+-- ,¥q by products of Levi-Civita tensors €y,q5..ay tO 
make it a covariant tensor. To give an example, let us consider the 
mixed tensor ieee satisfying (it is anti-symmetric in the covariant as 
well as the contravariant indices) 


mos Aba mos 
Die = yy = 


ub 
ignites (5.61) 


where the repeated index yz is being summed in the second relation. 
This tensor would correspond to the generalized Young symbol (in 
the earlier notation m = n = 2 in this case) 


N-4 
_—~_— 
COCO ees Otel), (5.62) 


It is clear that, for N = 3, this symbol is not of the form (fi, fa, fs) 
and, therefore, it must vanish identically in this case. To see this, 
let us introduce a new lower rank tensor Er. by (repeated indices are 
summed) 


A 
Y 


—aB 


1 
= qeabye Ty (5.63) 


where €;;x denote three dimensional Levi-Civita tensors. Using the 
identity satisfied by the Levi-Civita tensor 


ee apy = 54 (5364 — 6453) — 55(5%5n — 64,62) 
+ 54 (5X53 — 5360), (5.64) 
we obtain from (5.63) (repeated indices are summed) 


PA 1 \ARUVY GPA ALE NAY FRVA OAR AV 
T= 4 (Oe Dy Py + 89 T+ Po — t,) 
= 0, (5.65) 


because of the traceless condition in (5.61). Relation (5.63) is invert- 
ible and it follows from (5.65) that 


Tee = eure T, = 0, (5.66) 


so that this tensor indeed vanishes identically for N = 3. 
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The importance of the generalized (mixed) tensors Tjitj3.-;, tO- 


gether with the generalization of the Young tableau lies in the fact 
that all irreducible representations of the U(N) group can be ob- 
tained in this way, as has been shown by H. Weyl. He also calculates 
the formula for the character dimension of these irreducible represen- 
tations. Let a represent a N x N unitary matrix corresponding to the 
group element a € U(N). Then, the matrix a can be diagonalized by 
another N x N unitrary matrix S so that (we are using a to denote 
the group element as well as its defining matrix representation and 
we note that S € U(N)) 


& 0 0 O 
Oi. Be OT SO. 

Ses =a5= OY 0s ee. 0" ella Sigs En. (5.67) 
0 0 O 


Let V(a) denote a n x n matrix representation of the U(N) group 
corresponding to the group element a € U(N). It follows, therefore, 
that 


Vian) = V(S" 14S) = V(S)V(a)V(S). (5.68) 
Taking the trace of both sides yields 
TrV (ap) = Tr (V(S™')V(a)V(S)) = Tr (V(S)V(S™)V(a)) 
= Tr(E,V(a)) = Tr V(a), (5.69) 


where we have used the cyclicity of the trace. The character of a 
representation V(a) of the group U(N) can now be defined as 


xv (a) = TrV(a) = TrV(ap) = xv(ap), (5.70) 


which is a function of the diagonal elements £1, €,--- ,&y of ap in 
(5.67) satisfying |€;| = 1,7 = 1,2,--- ,N (recall that the matrix a is 
unitary). Namely, we can write 


Xv (a) = xv (£1, 2,°++ ,€w). (5.71) 


We note here parenthetically that for the defining representation, the 
character is simply given by x(a) = & + 2 +---+&y. 

Suppose the representation V(a) is irreducible with the Young 
tableau corresponding to (f1, fo,:-: , fn) satisfying f; > fo >--- > 
fx. Let us define 


&;= fy +(N—3), (5.72) 


5.3 IRREDUCIBLE TENSOR REPRESENTATIONNS OF SU(N) 139 


so that we have 
fy > lg > +++ > by. (5.73) 


Let us define 


K (61, £9, eae aba) = det : : . 5 (5.74) 


(En) (Ey) oe (Ex) 
Then, Weyl’s character formula is defined in terms of this determinant 


as 


KBs 0) 
K((N — 1), (N — 2),---1,0) 


xv (41, f2,°°° sen) = (5.75) 


From this formula, Weyl also determines the dimension of the irre- 
ducible representation to be given by 


N 
Lae ee) 


by noting that the dimension of a representation can be thought of 
as 


Dy (fis fas+ > fn) = TrV(e) = xv(e) = xv(,4,--- 1). (5.77) 


5.3 Irreducible tensor representations of the SU(N) group 


For the SU(N) group we note that the N x N unitary matrix a € 
U(N) satisfies the additional condition deta = 1. In this case, for 
irreducible representations we need not consider the contravariant (or 
mixed) tensors (since the tensor indices can be raised using the Levi- 
Civita tensor leading to an equivalent representation). Moreover, as 
we will show next we can always set fy = 0, if we wish. Therefore, 
irreducible representations of the SU(N) group can be realized by co- 
variant tensors Ty, ;15.--4, With the Young tableau (f1, fo,--- , fy—1,0) 
with f, > feet S fy 0. 

These facts can be understood as follows. Let Ajj, ji... denote a 
totally anti-symmetric covariant tensor with N indices 11, j12,--- , Ln. 
Using the totally anti-symmetric Levi-Civita tensor in N dimensions 
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(with €123.... = 1), we can also write this tensor as (in N dimensions 
any completely anti-symmetric Nth rank tensor is proportional to 
the Levi-Civita tensor) 


Aspen = Eyipo-un P> (5.78) 


for some scalar ¢. Under the action of a € U(N), the tensor trans- 
forms as (repeated indices are summed, see (5.15)) 


/ — 71702... pON 
Ajspapy 7 Au pean = Ui Ua °°? Gx Acras-an 
— ql yO2 1. GON 
= Oy ys An Earar-an P 
= (deta) €uipo-py? = (deta) Apipo-py> (5.79) 


where we have used the definition of the determinant of a matrix in 
terms of the Levi-Civita tensor, namely, 

Qin Oye FUN Eayag-an = (deta) €pipo--py- 
Note that this tensor has the Young tableau notation of [1,1,1,--- , 1] 
(N unit elements). If a € SU(N) so that deta = 1, then it follows 
from (5.79) that 


Ai sstaty >; Aas aia (5.80) 


which describes the trivial representation of SU(N) (namely, ¢ is 
indeed a scalar). As a result, this shows that for SU(N) the repre- 
sentation with the Young diagram [1,1,1,--- ,1] (NV unit elements) 
is equivalent to that with the Young diagram [0,0,0,--- ,0]. More 
generally we can show that any tensor T, with Young tableau (f; + 
A, fa t+A,---, fw +A) for any integer A has the transform given by 
(det a)* times the transform of the tensor with the Young tableau 
(fi, fo,-:: , fv). In particular, for the SU(N) group, the two tensors 
transform exactly in the same way since det a = 1, namely, they are 
equivalent representations. In other words, for the SU(N) group 


(fi +A, fot A,--:, fy +A) = (fi, fa,-:-, fw), (5.81) 


for any integer A. In particular, if we choose A = —fy, this implies 
that we can effectively set fy = 0 as we have mentioned earlier. 

To illustrate this and to indicate how contravariant (or mixed) 
tensors are equivalent to covariant ones, let us consider the group 
SU(3) with N = 3. Let us consider the traceless mixed second rank 
tensor T/’, u,v = 1,2,3 so that it has the Young tableau given by 
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(1,0, —1). If we choose A = 1, then we see from (5.81) that we have 
an equivalent representation given by the Young tableau (2, 1,0). The 
correspondence is indeed established by identifying (repeated indices 
are summed) 


Mir = Epa 5 (5.82) 


satisfying (see (5.33)) 


ron = ee Mir Sin Mia a 1 = 0. (5.83) 


The second relation in (5.83) holds because of the (three dimensional) 
identity 


Ewaly, + €vral + rie Lp: = Gawain. = 0, (5.84) 


since, by assumption, the tensor Ty’ is traceless. The tensor M hie 
as we have seen in (5.44), has the associated Young tableau 


x 
ee ; (5.85) 


Conversely, the second rank mixed tensor T/’ can be expressed in 
terms of the covariant third rank tensor M i , (by inverting the rela- 
tion (5.82)) as 

1 aBysl 

Lie 5 er Ma Bis (5.86) 
where €#”* = Ey numerically. In this way, one can argue that it is 
sufficient to study the representation in terms of covariant tensor in 
the case of SU(N). (Remember that deta 4 1 in the case of U(N) 
and, therefore, this argument does not generalize to U(N).) 

For the group SU(2) similar arguments lead to the conclusion 
that all irreducible representations can be obtained from the totally 
symmetric tensors Sj; y19---u, Where 1, [2,°** , Un = 1,2. The dimen- 
sion of the representation, in this case, follows to correspond to (see 
discussion in the paragraph containing eqs. (5.47)-(5.48)) 

! 

D(n) = ti =n+1, (5.87) 
which can be identified with 27 + 1 for 7 = oe n = 0,1,2,--- repre- 
senting the familiar multiples of half integer values for the angular 
momentum 7. 
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5.4 Product representation and branching rule 


Let U;(a) and U2(a) respectively denote d; x d, and dz x dz irreducible 
matrix representations of the U(N) group. Then, the tensor product 


W (a) = Uj(a) ® U2(a), (5.88) 


also defines a representation of U(N) with dimension d,dz (namely, 
(did) x (didz matrices). That the tensor product defines a represen- 
tation is easily seen from (5.88) 


W (a)W (6) = (U1 (a) ® U2(a)) (U1(b) ® U2(d)) 
= U;(a)U;(b) ® U2(a)U2(b) = U1 (ab) ® U2(ab) 
= W(ab). (5.89) 
The dimensionality follows from 
W(e) — Ux(e) & U2(e) = Fay do — lado; (5.90) 


where Eqyd. = ld,d. represents the d;dz dimensional unit matrix. 
However, the product representation W(a) is not, in general, irre- 
ducible. 

Let us recall that for two independent (covariant) vectors x, and 
yv, their product defines a tensor T),, = ,y, which can be decom- 
posed into the irreducible forms (see discussion following (5.15)) 


Li Se Aas (5.91) 


where S,,, and A,, denote the symmetric and the anti-symmetric 
combinations respectively (see (5.17)). Regarding the product as 
XL, ®& y, for more generalized settings, the discussion carries over 
without any modification. We can assign as before 


Gi) ge Si = fe | 1 | Ay = nell (5.92) 


V 


Dropping the indices u,v, we can write the decomposition in (5.91) 
symbolically as 


| Jel] =| ® (5.93) 


Similarly, if x,,, yv, z, denote three independent vectors, then we have 
already noted in (5.35) that we can express 
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Tw = Spor + Aur + (Murra + Mi) — (Moray + Mpay)- (5-94) 


If we consider the product of vectors as a tensor product x, ® yy ® 2). 
we can again write the decomposition in (5.94) as 


L J@LJ@_J=LL | Je_ |e ® 10:99) 


and so on. 

In general the product of two irreducible representations can be 
decomposed into a direct sum of irreducible representations which 
can be shown by considering their character. For the irreducible 
representations U;(a) and U2(a) of the unitary group U(N), their 
product W(a) = U;(a) ® U2(a) can be written in a suitable basis in 
the fully reducible form 


Wi (a) 0 0 0 
0 Wo(a) 0 vee 0 
S 1W(a)S = 
0 0 W3(a) 0 
0 0 0 . | Wr(a) 


(5.96) 


for irreducible matrices W1(a), W2(a),--- ,W,(a). Taking the trace 
of this relation we obtain 


or, x(a) = 5° x;(a). (5.97) 


Namely, the character of the product representation (a) is simply 
the sum of the characters y;(a) of the irreducible representations 
contained in the representation. On the other hand, we also know 
that we can write the character of the product representation as 


x(a) = TrW(a) = Tr (Ui (a) @ Ua(a)) 
= x(Ui(a))x(U2(a)), (5.98) 
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namely, as a product of the characters of the two irreducible repre- 
sentations. It follows from (5.97) and (5.98) that 


x(a) = x(U1(a))x(W2(a)) = S° x; (a). (5.99) 
j=l 
This also implies that 
Dim (U1 (a)) Dim (U2(a)) = 5" Dim; (a), (5.100) 
j=l 


where Dim; denotes the dimension of the jth irreducible block. Namely, 
there must be a matching of dimensions for the decomposition of a 
product representation into sums of irreducible representations. 

To illustrate this, let us return to the example discussed in 
(5.93). In the present case, it is far easier to compute the characters 
of the individual representations directly rather than using Weyl’s 
formula (5.75). For the purpose of the calculation of characters we 
can choose the basis in which a is diagonal (see (5.67)) 


0 & O =: 
ak 0 0 &3 eh 0 ’ lEal= 1 w=1,2,--- ,N, 
0 0 0 i 
(5.101) 


so that the covariant vectors transform as (repeated indices are not 
summed) 


Tienes “pe 2 ee Ns (5.102) 


Therefore, the symmetric and anti-symmetric products transform as 
(for fixed pu, v) 


(pyr xo CV) =. Cneol Cauy = Loi) (5.103) 
and this leads to 


x ( = eas 


i if (eV 5 
x ( a oe Se oh +S°(&) ; 
p=1 p=1 


HV 
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2 
N 


1 N 
x( ) => k= fu) — Dd (&u)]- (6-104) 
1 


HOV w= yw=l1 


This shows indeed that 


pee hax 1 Saat )+x( i (5.105) 


Moreover, if we set €; = £2 = --- = €y = 1 (see (5.77)), then their 
dimensions can be computed from (5.104) correctly as (they also co- 
incide with the earlier method discussed for calculating dimensions) 


N 
Dim[ ]=S01=N, Dim = 5(N? +N), 


1 
pa lollet 
| | 2 

For the general case, there exists a simple mechanical way of 
decomposing the product of two irreducible representations. For sim- 
plicity, we consider the case of 


(NAN): (5.106) 


| @ : (5.107) 
For this, we label =] a /|and construct the product as 
— _ a | 
® [al er, \@ : (5.108) 


namely, we add [a] to the Young tableau | without violating 


the condition f; > fe >--- > fw > 0 (see (5.46)). Such a rule 
can be obtained by computing the product of their characters and 
decomposing it into a sum of characters of irreducible representations 
(or by matching the dimensions as in (5.100)). For example, 


| @lal= |al@ , 


Sala aicy (5.109) 
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correponding to the decompositions of (see (5.29) and (5.34) as well 
as (5.106), for example, for the dimensions) 


a(n 
x( )x( ea J +x ) (5.110) 


We can, similarly, calculate 


eLlel]=( ® Jen 


and 


= al® SP) “6 5) 
a 


- (5.111) 


to obtain the decomposition given already as the decomposition of 
tensor T),, into its irreducible arguments (see (5.35)). 

This procedure can be generalized to more complicated products 
and we give only a few more examples in the following 


a 
er alj=|[*l#le ; (5.112) 
a 


Note that in the decomposition of the product on the right hand side, 
we do not allow the diagram 


(5.113) 


since, two | a |’s are initially in a totally symmetric configuration of 


a|a |. Similarly for products involving an anti-symmetric diagram, 
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we proceed as follows: we write as ; , and compute 
a a | a 
l= | ah an er: (5.114) 
e b 


Once again, we do not allow the diagram 


a F (5.115) 


since | a | and | b | must be in the anti-symmetric configuration. Fur- 
thermore, we note that if N < 3, then we can omit the last diagram 


= (1,1,1,1), (5.116) 


a 


b 


as it violates the allowed Young tableau symbol (f1, fo, f3) for U(3) 
(namely, there is no nontrivial fourth rank anti-symmetric tensor in 
three dimensions). 

Another subject of importance in physics is the branching rule, 
namely, how does a representation of a given group decompose into 
representations of its subgroups. Let us consider the covariant vector 
Lp, #2 = 1,2,= N which corresponds to the basis of the irreducible 
N-dimensional matrix representation of the U(N) group with the 
Young tableau (1,0,0,--- ,0) with “0” occurring (IV — 1) times. But 

—-__—”’ 
N-1 
if we restrict ourselves to its subgroup U(N — 1), then it is no longer 
irreducible. Indeed 


ej, g=1,2,---,N-], (5.117) 


provides an irreducible (N — 1) dimensional representation of the 
U(N — 1) group, while ¢ = xy remains invariant under U(N — 1). 
We can express this as 


U(N) + U(N — 1), 


(LC), =(D)y. ® (ber, 
or, N=(N-1)61, (5.118) 
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where {1} represents the one-dimensional trivial representation of 
U(N — 1). Similarly, the symmetric tensor of U(N) 


Sw = Sv, v= 1,2,---, N, (5.119) 
decomposes into the symmetric tensor 
Si Sep es 4,k=1,2-* ,N—1, (5.120) 


and the vector 


oj = cet Ns 1? GS ate gino es (5.121) 
as well as the scalar ¢9 = Syy = {1}y_1. In this case, we can write 

U(N) > U(N - 1), 

( | y= ( | i , 2 ( De ,® {1}w-1- (5.122) 


By reducing the characters of U(N) as a sum of characters of 
U(N — 1) group with €, = 1, Weyl has shown that the general case 
is obtained as follows 


Un fae ste. s. BU AG ey 
ret tae Ae 
(5.123) 


GQ), (1) 
2) 


where the summation is over all A, - , fn2, satisfying 


het Spe Sees ashe CID 


Here FY, f©, vee, (1) ) represents the Young diagram of U(N — 1) 
or the first level of reduction. Replacing N by N — 1, we can further 
decompose U(N — 1) into U(.N — 2) etc to obtain that an irreducible 
representation (f1, fo--- , fy) of U(N) can be decomposed to 


U(N) > U(N —- 1) 3 U(N - 2) 5.--- 3 U(I), (5.125) 


by considering the inverted pyramid 
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where f) > ft? > f)) > AP?) >... > fy); forj =0,1,2,--- ,N- 


1 and we have identified fO = fi, 1=1,2,---,N. 

As an example, consider the case of SU(2) group. As we have 
already noted (at the end of section 5.3), in this case, any irreducible 
representation can be obtained from the totally symmetric tensor 
Siz p2...p Corresponding to Young tableau of 


M1 | H2 |: | Be 12 
«+ f boxes > ery) 


with dimension D(f) = f +1 (see (5.87) with f =n). In order to 
apply the branching rule, we embed this representation of SU(2) into 
the U(2) group to correspond to (f,0) with f; = f, fo = 0. Then, 
using the reduction U(2) + U(1), we see from (5.123) that we can 
write (recall that for SU(N) we can always set fy = 0 so that the 
Young tableau would have only (N — 1) nontrivial entries) 


W((f,0))= So eu((f)). (5.127) 


frfO2>0 


Setting f = 23, @ = 0,5,5,3,°°:) with D(f) = f+1=2j4+1 
(see (5.87) and the discussion afterwards), and f) = j +m, (m= 
j,j —1,-++,-J9), this reproduces the familiar angular momentum 
basis vectors |j,m) as 


emi Woy f S253 fs7im, WH Fits bees. 
(5.128) 


In fact, the basis state |j,m) corresponds to a tensor component 


Sag Q2 9 (5.129) 
Se 
jm j—m 
for the totally symmetric tensor Spy ppp Hi, H2,.-.,f¢ = lor 2. 


5.5 Representations of SO(V) groups 


Let us recall the fact that the orthogonal group O(N) is the group of 
all linear transformations (see section 1.2.5 as well as section 2.1) 


N N 
te Saas (= Sata) ; (5.130) 
v=1 v=1 
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which leave the quadratic form 

i taste +23, (5.131) 
invariant. The N x N matrix (a), satisfies, as in (1.67), 

@a= Fy, (5.132) 


where a” denotes the transform matrix of “a”. If we impose the 
additional condition of 


Det a= 1, (5.133) 


on the matrix “a”, then the resulting restricted subgroup is called 
the special orthogonal group SO(JV). If all the a,,’s are real, i.e. 


Ow = Ay, L,v= 1, 2, ae JN, (5.134) 


then the group is written as O(N, R) or SO(N, R) accordingly. How- 
ever, if a,,’s are complex numbers, then the resulting group is de- 
noted as O(N, C) or SO(N, C). We note that O(N, R) and SO(N,R) 
are compact groups, while O(N, C) and SO(N,C) are non-compact. 
Another important group is the non-compact group O(p, q) which is 
the group of all real linear transformations which leave the quadratic 
form 


(ef tag tes +95) — (py to + Bong), (5.135) 
invariant for real coordinates 71, @2,°++ ,Tp+q- 


We will first discuss tensor representations of O(N) and SO(N) 
groups, which contain both SO(N, R) and SO(N, C) as special cases. 
Since a~! = a® (see (5.132)), we see that the contravariant vector a 
transforms as (see (5.9)) 


at — (2!\# =D i(a7") Ha” =D 0(a") ha 


N N 
= =) ge’. (5.136) 
v=1 v=1 


Therefore, it transforms exactly in the same way as the covariant 
the covariant vector x,. As a result, we may identify x“ = x, so 
that there is no distinction between the covariant and contravari- 
ant vectors. Consequently, we need consider only covariant tensors 
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T1419, However, we note for example that the irreducible mixed 
tensor T) iw = Ty» must satisfy the traceless condition 


on, =0= sor = 0, 
A=1 A=1 


nr n 
oS Dp SOS ST (5.137) 
A=1 A=1 


and so on. In general, if T),,,,....,,, denotes an irreducible tensor of 
the O(N) group, we must impose the traceless conditions such as 


n n n 
Sr i = se Dyi2dua-pn = Ds Tyrp2us~-Hn = 9, (5-138) 
d=1 d=1 d=1 


etc for any pair of indices uj and pip, 7 #k of the tensor. 

Let us now restrict ourselves to the SO(N) subgroup. As in the 
discussion of the SU(N) group, the completely anti-symmetric Levi- 
Civita tensor €y; jn, = €14?"" is invariant under SO(N) (recall 
that deta = 1 in this case). Together with the fact that there is no 
distinction between the covariant and the contravariant tensors, we 
can further restrict the irreducible tensors of SO(N) as follows. We 
have now two different cases to discuss, depending upon N being an 
even or an odd integer. Let us consider first the case of N = 2M +1 
to be an odd integer. 

For simplicity, consider the case of M = 2 and N = 5. (The 
case M = 1 and N = 3 has been discussed in detail earlier following 
(5.81).) Note that the totally anti-symmetric tensor A,,,, [,V,\ = 
1,2,--- ,5, has the Young tableau representation 


bb 
y |. (5.139) 
r 


However, A,,,,, is equivalent to another anti-symmetric tensor ¢ag = 
—Ba given by 


5 
1 
Pog — 3 SS Cabin A ur: (5.140) 
“ pwwrA=1 


Conversely, we can express 


5 
1 
Aw = a! S EwwraB Pap: (5.141) 
a,B=1 
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As aresult, we see that we can make the Young tableaux identification 


pL 
ae ; (5.142) 
r 


Correspondingly, we see that we need consider only irreducible ten- 
sors with smaller Young tableau (fi, fo,--- , fz) of at most M entries 
instead of the larger Young tableaux (fi, fo,--- , fy) with N = 2M+41 
entries. 

Next, let us discuss the case of N = 2M being even. For this 
case, using the Levi-Civita tensor, we can also show that we need 
consider only tensors with smaller Young tableaux (f1, fo,--- far) of 
at most M entries. However, we encounter a little complication of the 
following nature. Again for simplicity, we discuss only the simplest 
case of M = 2 and N = 4. (The case M = 1 and N = 2 has been 
discussed at length earlier in this chapter.) If Guy = —Grp, wv = 
1, 2,3, 4, is an anti-symmetric tensor with the Young tableaux 


" (5.143) 


then we can define another antisymmetric tensor 


4 
a 1 
Gu = on Ss" EwwapsGap, (5.144) 
a,B=1 


with the same Young tableaux. However, a difficulty arises in that 
Gv and *G,, are not irreducible under SO(4). For example, we note 
that if we define 


FE) = 5 (Gav — Orig ls (5.145) 


then it satisfies 


4 
* D(+ 1 se ah 
FD) = 5 >) ewapkag = +E). (5.146) 
; a,B=1 


The tensors F; (+) are called selfdual or anti-selfdual tensors of SO(4) 
and are irreducible tensors of SO(4). In comparison to G,,, which 


has 6-independent components, Fr) have only 3-independent com- 
ponents each. This is slightly more familiar from electromagnetism 
where the relevant group is SO(3,1) (the Lorentz group) instead of 
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the SO(4) group being discussed here. In conclusion, we note that for 
the case of N = 2M, if the tensor T),,...,,, has the Young tableau of 
form (fi, fo,-++ fir) with fy, > 0, then we must impose similar duality 
restrictions on the tensor in order to determine its irreducibility. 

So far, we have discussed the groups O(N) are SO(N) and with- 
out making any distinction between SO(N, R) are SO(N,C). For the 
case of SO(N, R), if Ty ps.--y, iS an irreducible tensor and transform 
as 


N 
Tya-pn Dies = > Orv Gpave *** UpnvnLry-vns (5-147) 
V{V2-"*Un=l 
then its complex conjugate T* transforms exactly in the same 


Hi bain 
way as Ty,,...u, (Since a,, are real in this case). In other words, 


they give the same equivalent irreducible representation of SO(N, R). 
However, in the case of SO(N,C), this is not case and the two give 
inequivalent representations. Namely, if U(a) is a n x n irreducible 
matrix representation of SO(N, C), then its complex conjugate 


W(a)=U*(a) (equivalently U(a*)), (5.148) 


gives an inequivalent irreducible representation of the same dimen- 
sion, called the complex conjugate representation. We will not go 
into any further discussion on this topic. 

It may be instructive to discuss briefly the case of SO(3,R), 
where we need consider only the completely symmetric tensor Sj; y19---un 
with 41, la--: , ln = 1, 2,3 satisfying the traceless condition 


3 
> Sr uate = 0, (5.149) 
A=1 


for n > 2. The dimension of such a representation (the number of 
independent components of such a tensor) is exactly computed to be 


D=2n+1. (5.150) 


3 
For example, for n = 2, the tensor S,,, = Sy, satisfying 5° S,,, = 0 
p=1 
has (5 x3x4—1=5=2x2+1) 5 independent components. Com- 
paring (5.150) with the dimensions of representations of the standard 
angular momentum algebra of quantum mechanics (D = 27 + 1), we 
see that n = 7 = 0,1,2,3,--- , i.e. these tensors realize all integer 
angular momentum states, but not the cases of half-integer states 


154 5 IRREDUCIBLE TENSOR REPRESENTATIONS AND YOUNG TABLEAU 


5,3,+++, in contrast to the case of SU(2) discussed already. 


Properly speaking, the half-integer representations are not represen- 
tations of SO(3), rather they are the representations of the covering 
group spin(3), since SO(3) is not simply connected. Nevertheless, 
in view of their importance in physics, we will discuss briefly the 
so-called double-valued spinor representations of SO(N) group (for 
N = 3) in the following section. 


with j = 


5.6 Double valued representation of SO(3) 


Let us recall that a general rotation of the three dimensional (Eu- 
clidean) vector can be written as 


/ 


x x x 
yl oly | =R64v) ly], (5.151) 
Zz z! Zz 


where the (3 x 3) rotation matrix R(0, ¢,~) can be parameterized in 
terms of the (three) Euler’s angles ~,¢@ and ~ as 


RO, 9, Y) = 


csocsy — csOsnodsny —sndcsy —csécsdsny  snésnw 
—csOsny + csOsndcsy —sndsny+csécsdcsy —sndcsw 
snOsnod snOcsh cs0 
(5.152) 


Here we have abbreviated “cosine” by “cs” and “sine” by “sn” for 
simplicity. Furthrmore, we note that the ranges of the Euler angles 
are given by0 <0<7,0<¢@< 27,0 < w < 27. The rotation matrix 
in (5.152) can also be written as a product simpler matrices 


RO, 6, %) = Rap) (0) R3(9), (5.153) 


where R,, Rg denote respectively rotations around the «, z axes, namely, 


1 0 0 
Ri(0)=]0 cos? —siné], 
0 sin@ cosé 


cos@¢ —sing 0 


R3(¢) = | sind cosd@ OO]. (5.154) 
0 0 1 
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Considering infinitesimal rotations, we have the generators of 
the three rotations around the x,y, z axes (or the basis elements of 
the Lie algebra) given by 


0 0 O 0 0 7% 
J, = 0 0 at, Jz = 0 0 0 5 
0 -—27 0 —7 0 0 
0 2 0 
Jz =|-i 0 O], (5.155) 
0 0 0 


and they can be easily checked to satisfy the Lie algebra of so(3), 
namely, 


[soa | = Vege hs - “ek HAG: (5.156) 
It is now straightforward to verify that we can write 

R,(0) =e, Rg (o) = e'?8, (5.157) 
so that we can write (see (5.153) and (5.157)) 

RO, b, ) = et? 73 e091 cies, (5.158) 


It is worth noting here that this discussion has been within the con- 
text of the defining (3 x 3) matrices. For a general representation of 
SO(3), we can replace the generators J;, J2, J3 by the corresponding 
(27 + 1) x (27 + 1) dimensional matrices. 

Let us next note that for rotations around the z-axis 


R3(¢) = 8, (5.159) 


since ¢ and ¢+ 27 realize the same rotation around the z-axis, we 
must have 


R3(o+2m) = Rad), or, 28 = 1, (5.160) 


which can be satisfied only for integer values of the angular momen- 
tum eigenvalues 7. We can verify that (5.160) holds for the 3 x 3 
matrix representation given in (5.155). However, for the two dimen- 
sional (spinor) representation of the Lie algebra, we have 


1 
Ke= 50, 1=1,2,38, (5.161) 
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where o; denote the three Pauli matrices. In this case we have 


R3(¢) = en 073 — (cos 5) 1+% (sin 5 03 


ex? 0 
_ ; Ay |e (5.162) 
e€ 


In this case, it is clear that 


R3(o+ 2m) = —Ra(), Rao + 4) = Ra(9), (5.163) 


namely, it represents a double valued representation. 
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CHAPTER 6 


Clifford algebra 


6.1 Clifford algebra 


We start with the defining relation for a Clifford algebra (named 
after the British mathematician William Kingdon Clifford) which is 
generated by matrices 71,72°-: , Yn Satisfying the algebraic relation 


VW + Wp = 26uv1, (6.1) 
with w,v = 1,2,---,N. Setting u = v in (6.1), we note that 
=I, (6.2) 


for any fixed uw = 1,2,---,N. Also, if uw 4 v, it follows from (6.1) 
that 


WY =~ ps (6.3) 


so that y, and y with uw ~ v anti-commute with each other. The 
matrices 7,, are known as the generators of the Clifford algebra (6.1). 
The study of Clifford algebra is important in physics in connection 
with Dirac equation which we will discuss in the next chapter. 


6.1.1 Dimension of the representation. The matrix dimensions of the 
generators 7, (and, therefore, a representation of the Clifford algebra) 
can be determined from the relations (6.1)-(6.3) as follows. In this 
section we will restrict ourselves to irreducible representations and 
reducible representations will be discussed in the next section. For the 
irreducible representations, there are at least two ways of determining 
the dimension and here we will discuss one of the methods. The 
second and more direct determination will be carried out in the next 
section. 


157 
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Let us assume that the matrices y, provide an irreducible repre- 
sentation of the Clifford algebra (6.1). From the matrices 1 and ¥,, 
we can construct the set of matrices 


Ly Yer WW (HS), Ya (U<v<a), ++, 19298 YN (6.4) 


Any higher order product of the y, matrices clearly reduces to one 
of these structures because of (6.2) and (6.3). The number of such 
matrices in each category is easily calculated to be respectively (the 
binomial coefficients) 


1,N, NW 1), SN(N -1)(N —2),---, 1, (6.5) 


3! 


so that the total number of such matrices in the set (6.4) is given by 


1+N4 L N(N 1) + aN(N L)(N= 2) aS"). (66) 


We can easily verify that the matrices in (6.4) are linearly inde- 
pendent in the following manner. Let us denote the set of matrices 
collectively as P4 with A =0,1,2,--- ,2% —1 and note that 


(Ta)? = eal, (6.7) 


with ¢4 = +1 for any A = 0,1,2,--- ,2% — 1. For example, we see 
that for p Av 


(ww)? = Wy = Wl Ip) Ww =-l, (6.8) 


Equation (6.7) implies that Tj’ exists and can be identified with 
(recall that €?, = 1) 


(eee aye (6.9) 


Also, for any two Ty and Ip, there exists '¢ in the set of matrices 
satisfying 


Tale =easlc, (6.10) 


with €4g = +1. Moreover, [¢ = 1 is possible only for [4 =Tg. For 
example, we note that if 14 = 7, and lg = yy with p< v, then 


Ys if A=yp 
—y,, if A=y»y, 
Palp=pww= 4 Ww if A<u<y, . (6.11) 
— WI Wt w<rA<y, 
Wwya, ib w<v<da. 
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Further, for a fixed Ig, a set consisting of all [41g covers the original 
set of matrices in (6.4) [4 = 1,9, %Ws-+: »V17273°°* Yn» Up to 
signs. 

To proceed we have to now make a distinction between the cases 
of N = 2M (even) and N = 2M +1 (odd) for the following reason. 
Let us denote 


A= 17273 °** Yn: (6.12) 
It now follows from (6.3) that 


A if N =2M + 1(odd 
‘= Yr i + 1(odd), (6.13) 
—Ay,, if N =2M(even) 
In the case of N = 2M +1, then, this implies that 
[A,T'4] = 0, (6.14) 


for any [’y in the set (6.4). Moreover, we have that 
1. Trl’, =0, unless Ty = 1 for the case if N = 2M (even), 


2. TrI', =0, unless Ty, = 1 or 4 = A for the case of N = 2M+1 
(odd). 


In order to show this, we note that for any fixed [4 such that '4 4 1 
for N even, and [4 4 1,A for N odd, we can always find some Ig 
satisfying 


T4=—Tpla4ry', or, Pala =—lala. (6.15) 


This can be checked directly, for example, if 4 = y,, we can choose 
Tg = yw where v ¥ pw and they will anti-commute. (Similarly, if 
TA = WW, # VY, we can choose [gp = y, ory, and the two will 
anticommute and so on.) Taking the trace of both sides in (6.15), we 
obtain 


TrP4 = —Tr (Uplaly') = -Tr (Pal gp Ts) 
STP al) = Tris =O; (6.16) 


where we have used the cyclicity under trace. This leads to the 
fact that I4’s are linearly independent for the case of N even. For 
example, suppose that there exist constants C4 satisfying 


QN-1 
S$ Cala =0, (6.17) 
A=0 
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then, multiplying with an arbitrary [gp and taking the trace of both 
sides, we obtain 
2N_-1 
S> CaTr (Tag) = 0. (6.18) 
A=0 
On the other hand, when N = 2M (even), we note that Tr (C4l'g) = 
Tr (eapl'c) = 0 (see from (6.10) and (6.16)) unless [¢ = 1 which is 
possible only if [4 =Ig. Therefore, in this case we can write 


oN = NT 
S° CaTr(Tul's) = S> Cadaz = Cp = 0, (6.19) 
A=0 A=0 


for any B. As a result, the constant coefficients in (6.17) cannot be 
nontrivial which proves the linear independence of the set of matrices 
T'4 for N = 2M (even). Since we have 2” linearly independent 
matrices I'4, the Burnside theorem implies that I'4’s must be 2” x 2™ 
matrices if N = 2M (even) and if [y’s are irreducible. However, we 
will prove the same result more directly in the next section. 

For the case of N = 2M + 1 (odd), we have already noted in 
(6.14) that 


ae Ree Oe (6.20) 
for all I'4 in the set (6.4). If the set of matrices [y is irreducible, 
then by Schur’s lemma we can identify 

A = Com4il. (6.21) 


Using the anti-commutativity of the matrices 7, (see (6.3)), it follows 
that 


eS (aor ee = (-1)MOM+) 1, (6.22) 
so that we determine (from (6.21)) 


(2D ¥eMr), MOM*)) 


or, Com+41 = (— ) (6.23) 


Com+1 — 


As a result, Cou. = £1 or Coy, = +V—1 depending on the value of 
M. Using (6.21) as well as (6.2), it follows now that 


A= VWY2 9° Yous = Comsil, 


Or, Yam+1 = cy y172 71+ Vom. (6.24) 


Namely, when N = 2M + 1 (odd), the algebra reduces to that 
of an even dimensional (NV = 2M) Clifford algebra generated by 
V1, 72s°** + Yom. Asa result, the matrices y, (I'4) are now realized as 
2” x 2™ matrices for an irreducible realization. 
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6.1.2 Reducible representation. Let us next show that a reducible rep- 
resentation of the Clifford algebra is fully reducible. Let us assume 
that 7, denotes a reducible representation of the Clifford algebra (6.1) 
of the lower triangular form (see section 2.2) 


(1) 
Yu 0 
Yu = 5 (6.25) 
« W 


where >) and 42?) denote irreducible representations of (6.1). Namely, 


they satisfy 
Wa) +P = 26,1, 
Pa +9P4P = 251, (6.20) 


(The off-diagonal elements also have to satisfy a relation such as 
(2.94), with N’ identified with N, in order that y,, denotes a repre- 
sentation of (6.1). However, this is not important for our discussion.) 
We can now construct the set of matrices [4 using 1 and y, which 
will all be lower triangular. Let us next define a block diagonal (fully 
reducible) representation of (6.1) of the form 


(1) 
= Y 0 
qi, = ( li >) (6.27) 


so that we can construct a second set of matrices 4 from 1 and ar 
which will all be block diagonal. 
Let us now construct the matrix 


Qn_1 


s=)- (Fa) Ta. (6.28) 


A=0 
It follows now that, for any fixed B, 
N-] 


(fs) STs = Ss (Fabs) (C40). (6.29) 
A=0 


On the other hand, we know (see (6.10)) that 
Talp=easlc, Pals =easlc, (6.30) 


for the same €4g and the same C (depending on A for a fixed B). 
As a result, it follows from (6.29) and (6.30) that 
N_] 


Gi) sige Ge) tee. (631) 


C=0 
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_ Since the matrices [4 are lower triangular and the matrices 
('4)~t = eal’, are block diagonal, by a direct straightforward cal- 
culation we can determine from (6.28) that 


QN (1) 0 
s=( Noone}? (6.32) 


for some rectangular matrix N whose exact form is not important. 
It follows now that 


det S £0, (6.33) 
so that the inverse $~! exists and we obtain from (6.31) that 
Tp =STzS"t, (6.34) 


for all B. This proves the full reducibility of the representation since 
we can always find a similarity matrix S which will bring +, in (6.25) 
into the block diagonal form (see section 2.2). A similar argument can 
be carried through if we choose the matrix y, in (6.25) to be upper 
triangular instead. This proves that any reducible representation of 
the Clifford algebra is fully reducible. 

We will now describe an alternate and direct way of calculating 
the dimension of an irreducible representation. As is clear from the 
above discussion, this is sufficient to determine the dimension of any 
reducible representation (since they are fully reducible). Let us as- 
sume that the generators y, are d x d irreducible matrices. For an 
arbitrary d x d matrix Y, let us define 


ON] 


Sa So Wa VEx (6.35) 
A=0 


It now follows that for a fixed B 


2N_-1 
((s) Sle = 5° (Cala) *Y (Cal's) 
A=0 
2N—1 
=o) Me = 6, 
C=0 
or: Sls =e, (6.36) 


where we have used [4I'p = east in the intermediate step. Since 
S commutes with all the matrices in the set, by Schur’s lemma we 
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conclude that S' must be proportional to the identity matrix so that 
we can write 


2N—1 
S= So (fa) VYTa=Al, 
A=0 
QN 1 
or, TrS= S- TrY = 2d, 
A=0 
ON 
or, AS a Try. (6.37) 
As a result, the matrix S has the form 
2N_1 oN 
SS 50a) whee (= mY) 1. (6.38) 
A=0 : 


Taking the (j,m) matrix element of both sides in (6.38) we ob- 
tain (summation over the repeated indices k,/ is understood) 


2N 1 oN oN 
> (Ca)je Yet (CA) im = aE OjimY kk = a OjmOKLY kl 
A=0 
QN-1 oN 

or, S> (Ta)je CA)im = a OjmOkls (6.39) 
A=0 


where 7,k,l,m = 1,2,---,d and we have used the fact that Y is an 
arbitrary d x d matrix. If we set 7 = k,l = m and sum over these 
two indices, we obtain 


QN_] 
So (ba) eS 2”. (6.40) 
A=0 


There are two cases to consider, namely, N even and N odd. For 
N = 2M (even), we know that TrI.4 = 0 except for [9 = 1 for which 
Trl'o = Tr (fo) | =d. Therefore, in this case (6.40) leads to 


d2=2N=2°M@) or, d=2™, (6.41) 


which coincides with our earlier result (see discussion after (6.18)). 
For the case of N = 2M +1, we have TrI’y = 0 unless [4 = 1 
or [4 = A and in each of the two cases Tr [4 « d (see (6.24)). 
Consequently, (6.40) leads to 


= ON = PM or, d= 2”™, (6.42) 


as we have already seen (see discussions after (6.24)). 
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6.1.3 Irreducible representation and its uniqueness. We will next show 
that an irreducible representation of the Clifford algebra is unique 
for the case of N = 2M (even), while there exist two inequivalent 
irreducible representations of the same dimensionality for the case of 
N = 2M-+1 (odd). Let 7, and ¥,, be two inequivalent representations 
of the Clifford algebra of dimensionlity d. We can construct two sets 
of matrices [4 and I'4 from (1,7) and (1,7,) respectively. Then, 
for any arbitrary d x d matrix Y, let us define 


QN_-] a 
ep eae) ma ae (6.43) 
A=0 


As before (see (6.36)), we can show that, for an arbitrary fixed B, 
the matrix S satisfies 


(Cp) Sl g=S, or, STg=TzS. (6.44) 


If pg and i B denote inequivalent representations, then we must have 
S = 0 identically by the second Schur’s lemma so that (repeated 
indices k,] are summed) 


2N_1 = 
yay aS, 
A=0 
2N_1 
aff = = 
or, > ade (Ba), Yar = 0, 
A=0 
QN 1 
=. f= - 
ae Sn; (Ta) =e (6.45) 
A=0 
for arbitrary j,k,l,m = 1,2,---,d since the elements Y;; are arbi- 


trary. In particular, choosing j = k and 1 = m in (6.45) and summing 
over these indices we obtain 


oN =T 2 
S° Tr (Fa)? Trl, =0. (6.46) 
A=0 


For the case N = 2M, only To = iy = Il give a nonzero contribution 
Try = Try = dd,o, so that (6.46) leads to 


d? =0, (6.47) 
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which is not compatible for a nontrivial d. Therefore, this proves 
that the Clifford algebra with an even N admits only one (unique) 
irreducible representation. However, in the case of N = 2M +1, both 
To = 1 and ['y5n_, = A lead to nontrivial traces. Using (6.21), we 


note that we can write A = cl,A = 1 so that (6.46), in this case, 
leads to 


d +c léd? =0, 


Or, “6 =]—=6, (6.48) 


which also implies that 


eae (6.49) 


This shows that, for N = 2M + 1 (odd), inequivalent representations 
of the Clifford algebra with the same dimensionality are possible. 

In fact, we note that given an irreducible representation y,,, we 
can find another irreducible representation ¥, simply given by 


A = —w (6.50) 


which, in odd dimensions (N = 2M + 1) indeed leads to 


A= 9192°**Yome1 = (—1)?"1 


172° Yama = —A. (6.51) 
That the two representations are irreducible can be seen simply from 
the following argument. Suppose that there exists a nonsingular d x 
d matrix (similarity transformation) S satisfying (relating the two 
representations) 


ens ao: (6.52) 


then, this would imply that 


1 "172 ee Yoma1 = So ly ie “Yom419) 
or, A=S AS =cS 1S =cl=A, (6.53) 


which would contradict (6.51). This shows that there cannot be a sim- 
ilarity transformation connecting the two representations and, there- 
fore, they are indeed inequivalent. However, there cannot be a third 
inequivalent representation beyond these two, which can be seen as 
follows. Suppose there exists another irreducible representation 7,, 
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so that (Yes Yar Yu) define three inequivalent representations of the 
Clifford algebra. In this case, applying the analysis of (6.48)-(6.49) 
pairwise, we obtain the relations 


A=-A, A=-A, A=-A, (6.54) 


which are clearly inconsistent. 

In conclusion, we see that N = 2M (even) admits only a unique 
irreducible representation of dimensionality 2” while there exist two 
inequivalent representations with the same dimensionality 2” for 
N = 2M -+1 (odd). In the latter case, the two representations may be 
simply related as ¥,, = —7,. For example, let us consider the case of 
M =1. For N = 2M = 2, we note that the three 2 x 2 (d = 2' = 2) 
Pauli matrices 


_(o1 _ (0 -i (i 0 cs 
o1= 1-8 s O25 Fi 0 ’ 03> (<1 ’ ‘ 


satisfy the Clifford algebra (y, = oy, = 1, 2,3) 
OpOp + OVC y= 2p 1, (6.56) 


and provide a unique (2-dimensional irreducible) representation. How- 
ever, for N = 2M + 1 = 3, in addition to the Pauli matrices (which, 
of course, provide a 2-dimensional irreducible representation), a sec- 
ond inequivalent representation of the Clifford algebra is given by 
o, = —o, and we note that 


A = 010903 =i, A = 616003 =-il= —A. (6.57) 


6.2 Charge conjugation 


Let us next discuss the charge conjugation matrix C' associated with 
a Clifford algebra. This is a rather important concept in relativis- 
tic quantum field theories involving fermions and is defined by the 
relation 


Cie = —, or, Cyrc ==, (6.58) 


where 7 denotes the transpose of the matrix +, (generators of the 
Clifford algebra). Since y, satisfies the Clifford algebra (6.1), it fol- 
lows that 7, = — = C~'y,C also satisfies the Clifford algebra 


WW + WV = 26yry1. (6.59) 
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Therefore, the charge conjugation matrix C can be thought of as 
a similarity matrix between the two representations. To study the 
properties of this matrix, we note that by taking the transpose of the 
definition in (6.58) we obtain 


zs T = = 23 
Yw=— (CC) = -OF yi (CN)? = CPC, CC)", 
or. 41 yy (ror : 
on (CPC yd. (6.60) 
It follows from this that C’C~! commutes with the entire set of ma- 
trices [4 and, therefore, must be proportional to the identity matrix. 
Therefore, writing 


o'¢o = Al, (6.61) 


where 7) is a constant scalar parameter, we obtain 


Ct = 1C, 
or, C=C! =7°C, 
Oe 4 (6.62) 
so that 7 = +1. 


We can determine the value of 7 as follows. Let us consider 
(6.39) and write it in the form 


oN ST oN 
S- (Tale (4) ml qe OjmOKl- (6.63) 
A=0 


Multiplying this with (ern OF and summing over the repeated in- 
dices, (6.63) leads to 


ON 1 oN oN 
A=0 


where we have used (6.62). We note that a [4 involves a product of 
the matrices y, of the form YyjYuo-** Yue With wi < w2 <--: < pe. 
For a fixed £, there will be esa such matrices in ’4. For every such 
I’, with a fixed @, we have (see (6.1) as well as the second relation in 


168 6 CLIFFORD ALGEBRA 


(6.58)) 
TpA-1 Tpr-1pn.T peA-l T A-1A,T p-l 
CVA S09, CO fC CU. Cae 
= (-1) Vue Vue1 °° Yue Vr 
’ e(€—1) 
SSS l) 2 aan tv ae 


=(-1) 


Using this in (6.64) and replacing the sum over A by a sum over 4, 
we obtain 


e(£+1) 
2 


Ta. (6.65) 


1 N 


~ 7 A 2 
d, (AICTAC™) 5, = 155m 
A=0 
N 
N! (e+) an 
Nom CD * bin = “pin 
e=0 
N 
d N! e(e+1) 
or, "= 5 iW oid | 1 er ee (6.66) 


To proceed further let us consider the case of even N. We recall 
the identities 


L=0 
> 
N! ‘is 
=?) amyiamyl 
N 
N!\ 
N _ oe Vin L e 
(1+a)% —(1-a) ~ wpa (* (-a)') 
IN 2 
sy8 : Ni! 2m+1 
~ £4 (N=2m-DiIQm+ Ii 


(6.67) 


Decomposing the sum in (6.66) into even and odd values of ¢ and 
using (6.41) as well as (6.67) we obtain (recall that d = 22 for even 
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N, see (6.41)) 


N 
2 
n= d N! ( 1) e Cnet) 
QN iF (N — 2m)!(2m)! 
N-1 
_@ N! (1) 2m+1(m+1) 
| aN < (N— 2m — 1m + 1)! 


eer [( + V=1)% + (1 - V-1)) 
+V-1((1+ V-1)% - (1- V-1%)] 


- an (era at et (heyy 
= ap [RY ae) 
= v2 cos 7(N +1). (6.68) 


Here we have used \/—1 = 7 in the intermediate step, but the expres- 
sion is symmetric in 7 + —7. For even N, therefore, we obtain 


v={ (-1)™ for N =4m, 


(-1)™*1 for N=4m+2. fe? 


The case of odd N, however, is a bit more complicated. Since 
there now exist two inequivalent irreducible representations y,, and 
Yu, there is no guarantee that =o) would belong to the original ir- 
reducible space of y,. If it does, then there will still be a charge 
conjugation matrix C satisfying (6.58), namely, 


Cty C= —y/). (6.70) 


On the other hand, if —y belongs to the second irreducible repre- 
sentation, the charge conjugation matrix has to satisfy instead (see 
(6.50)) 


CAC = = 
or; Cl yjC= ane (6.71) 
Repeating the earlier analysis leads to 
N=4m4+1: CoyC = one GT = (-1)"C, 
N=4m+3: Cy. = -¥, Cha epene 
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As an example, let us consider the case of m = 0 and N = 4m+3 = 3. 
In this case, as we have already seen (see (6.55)) we can identify 7, 
with the three 2 x 2 Pauli matrices y,, = o,. In this case, we obtain 


which satisfies (6.72) (the second relation). 

Let us conclude this section with the following observation. Let 
us consider oi which is the Hermitian conjugate of y,. Taking the 
Hermitian conjugate of (6.1), we see that yh, also satisfies the Clifford 
algebra 


ahah + Wah = 260 1. (6.74) 


Therefore, for N = 2M (even), there must exist a similarity trans- 
formation T relating the two 


GM ime ie (6.75) 


We know that we can always choose y,s to be Hermitian with a 
suitable choice of basis, namely, 


928 (6.76) 


for zp = 1,2,---,N (this holds even for odd N which we will not go 
into). For example, for N = 2M we can write 


O oy 
= ? 6.77 
Yu (" 0 ( ) 


with 2/—~1 x 2“—1 matrices o,, (not the Pauli matrices) satisfying 


'=olo,t+ol 


LL in von — 20uy 1, (6.78) 


oo} + 0,0 


so that (6.1) holds. For example, for the case M = 2, N = 4, we can 
identify the 2 x 2 matrices o,, with the three Pauli matrices as well as 
the identity matrix as o,, = (01,02, 03,71) and oh, = (01, 02,03, —i1) 
which satisfy (6.78) and lead to a Hermitian representation for 7,,. 
With this choice, we have 


1 0 
A= 17273874(= 95) = ( 5) (6.79) 


which is quite useful in the study of the Dirac equation. 
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6.3 Clifford algebra and the O(N) group 


Clifford algebras have an intimate relation with the representations 
of the O(N) group which can be seen in the following way. Let 
a € O(N)denote the N x N orthogonal matrix (in the defining rep- 
resentation) satisfying (in chapters 2 and 4 we have distinguished 
between a group element a from the corresponding defining matrix 
A by denoting a + A, here we use a to stand simultaneously for 
both the group element and its fundamental matrix representation 
to avoid proliferation of symbols) 


a a= aa? = 1. (6.80) 


Let y,, 4 = 1,2,--- , N represent the generators of a Clifford algebra 
satisfying (6.1). Then writing 


N 
Vp =a SS ayy Vv» (6.81) 
v=1 
we determine 
N 
Wy + Wp = Ss" AprAvp (YrYp + YeVa) 
A,p=1 
N 
= » Grup (25,1) 
A,p=1 
N N 
=2 Ss" AyAyy1 =2 S- Ay ,a4,1 
r=1 rA=1 
=2(aa") 1 = BI, (6.82) 


so that 7, also satisfy the Clifford algebra. Here we have used (6.80) 
in the last step. 

Let us assume that N = 2M. In this case, we know that the 
irreducible representation of the Clifford algebra is unique. Therefore, 
(6.82) implies that there exists a similarity transformation S(a) € 
O(N) 


N 
S~*(a)y,S(a) = YW = > Quy Vv» (6.83) 
v=1 


which transforms between equivalent irreducible representations of 
the Clifford algebra. (S(a) denote 2” x 2” matrices.) For the case 
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of N = 2M +1, on the other hand, this may not necessarily be correct 
since there exist two inequivalent irreducible representations. Let us 
suppose that ¥, belongs to the second inequivalent representation 


(—7,), namely, 


N 
S-*(a)(—Yw)S(@) = Fp = So uw, 
v=1 


N 
or, S~'(a)y,S(a) = — Ss" Gin Ve: (6.84) 
v=1 


In this case (odd N), we recall (see (6.24)) that 


A — 172 eee Yn 
1 N 
~ WI » Epi pon Yer V2 °° * Yun (6.85) 
* pape-enN=l 


which follows because of the anti-commutativity of 7s (see (6.3)) as 
well as the antisymmetry of the Levi-Civita tensor €,;,5.~,y. Fur- 
thermore, it follows from (6.84) and (6.85) that 


N 
= 1 
S~*(a)AS(a) = NI os Epa pon 


* pa pay N= 


x (9° 44 8)(S° "Yu55) ac (So Vig) 


()% < 
~ WI > Cui pn Opi °° * Sunun 
“P1257 LN, 
4,02," YN=1 
x (YW) 
N 
(det a) 
=F NI S- Evy v2--un Yi Yv2 °° * Yun 
“yy vay YN=1 
= —(deta) A, (6.86) 


where we have used the fact that N = 2M +1 as well as the definition 
of the determinant of a matrix, namely, 


N 


> Chie Oma @ Oyu — (dela | enuengs (6.87) 
His N= 
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On the other hand, we have seen in (6.24) that in odd dimensions 
A =c1. Using this in the left hand side of (6.86) we conclude that 


cS 11S =cl =A = -—(deta)A, 
or, deta=-—l. (6.88) 


Therefore, as long as we restrict ourselves to the special orthogonal 
eroup of transformations SO(N) satisfying a’a = 1 and deta = 1, 
we must have 


N 
S7\(a)yS(a) = >) aww, (6.89) 


even for the case N = 2M +1. We note here that the mirror reflec- 
tion (or parity) transformation is an example of a € O(N) satisfying 
deta = —1. This analysis shows that for the case of even N we can 
consider any a € O(N) while for odd N we will choose a € SO(N) 
for the rest of this discussion. 


Let a and 6b be two such matrices. Then, we can write (see 
(6.83)) 


N 


Son -> Quy v5 “1 (b) YS (b) = SS; buys (6.90) 


v=1 


which leads to 


Sb) (S(a)y5( = Yow (s ~*(b)wS(0)) 


3 Aubry pIp = 3 (6 Op ye 


v,p=1 p=1 
N 


S (ab) nop = S~"(ab)yS(ab), (6.91) 
p=1 


where ab denotes the matrix product of the two N x N matrices a and 
b (in the first term in the last step) so that ab € O(N) and satisfies 
(for the case of odd N) det(ab) = 1 if deta = detb = 1. It follows 
from (6.91) that 


S(ab)S~"(b)S~" (a) yp = YwS(ab)S~"(b)S~*(a), (6.92) 
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and since S(ab)S~+(b)S~!(a) commutes with all the generators 7, 
(and, therefore, with the set 4), by Schur’s lemma we have 


S(ab)S—1(b)S-1(a) = AL wy}, (6.93) 


for some parameter \ = A(a,b) in general depending on the group 
elements a,b. (Here [N/2] stands for the integer part of N/2.) Asa 
result, we can write 


S(ab) = d(a, b)$(a)$(b). (6.94) 


This implies that S(a) defines a projective (ray) representation (see 
section 4.4) of O(N) for N even and of SO(N) for N odd. 
We note that a relation of the form (6.90) 


N 
S~*(a)y5(a) = S- Quy v5 (6.95) 
v=1 


is invariant under S(a) > cS(a) so that with a proper choice of c, we 
can always assume 


det S(a) = 1, (6.96) 


for the transformation matrices S(a),.5(b), S(ab). Using this in (6.94) 
we obtain 


N 
det S(ab) = (A(a,))? 7’ det S(a) det 5(b), 
or, A(a,b) = e2mtk/(Z) (6.97) 


If we restrict ourselves to the case of SO(N) for even N, we can 
assume A(a, b) = 1 because of the following argument. For a = Ly, we 
can choose S(a) = $(1) = 1,n;2 since this corresponds to the trivial 
transformation. Therefore, starting with the point a = b = 1y, we 
can set A(1y,1y)) = 1. Since S(a) is supposed to be a continuous 
function of the group element a, as long as any other point on the 
group manifold can be reached continuously from the starting point 
(origin) a = ly, it follows from the argument of continuity that 
(a,b) = 1. This is true, in particular, for the group SO(N). On the 
other hand, the orthogonal group O(N) contains a disjoint mirror 
reflection symmetry which canot be reached continuously from the 
origin. In this case, for A(a,b) = 1, we have 


S(ab) = S(a)S(b), (6.98) 
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and S(a) realizes the “spinor” representation of the SO(N) group. 
Namely, we note that both S(a) and —S(a) correspond to the same 
rotation 


Yu st >y Quy v5 (6.99) 


and we have a double valued representation. This is due to the fact 
that the group SO(N) is not simply connected and we need to define 
the covering group which we will not go into. 

There is, however, a slight subtlety of the following nature that 
needs to be addressed. Even though we have assumed that y, defines 
an irreducible representation of the Clifford algebra, this does not 
imply that S(a) would correspond to an irreducible representation of 
the SO(N) group. For example, let N = 2M and let us consider as 
n (6.77) 


O Oy 
= : 6.100 
Vu & 0 ( ) 


In this case, we can find the block diagonal matrix S(a) (see (6.90)) 


= Si (a) 0) 
sta)=( . oe) (6.101) 


with the 2“@—! x 2™—! matrices S;(a),i = 1,2 satisfying 


oy 
Sy (a)o,Si(a 3 Ay Fv; 


N 
S3'(a)o},S2(a) = Sawa}, (6.102) 


and leading to S;(ab) = $;(a)5;(b),i = 1,2. Therefore, S1(a) and 
So(a) define 2@—-! x 2M—! dimensional irreducible representations 
of SO(N) while the block diagonal matrix S(a) corresponds to a 
reducible representation. The case of M = 2,N = 2M = 4 is of 
interest since in this case, the particular representation in (6.100) is 
related to the Weyl spinors in relativistic theories (and is known as the 
Weyl] representation). Furthermore, for some relativistic problems, it 
is of interest to consider a representation where all the generators ¥,, 
are real matrices (known as the Majorana representation in physics) 
which we will not go into. 
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Let us now discuss briefly about the connection between the 
Clifford algebra and the Lie algebra of the SO(V) group. If we define 


1 
Jw = 4 [Yu Ww] i —Jup; (6.103) 


then, it is straightforward to check that it satisfies the Lie algebra 
(3.24) of so(.NV) 


Biren Jag] = —Srodvg = by BF 0x + bupIva + Svat up: (6.104) 


If we further assume that yh, = 7 (namely, Hermitian generators), it 
follows from (6.103) that ae = —J,, corresponding to the Lie algebra 
structure of the compact group SO(N,R) = R™. However, following 
the earlier arguments, we can readily see that the J,,, constructed in 
this manner is not irreducible. In fact, considering the representation 
n (6.100), we can determine that 


(+) 
J V 0 
Juice ( e ) (6.105) 


0 Tis ) 
where 
1 
+ = + 
Je Z (0, C= ovo}.) = —JI), 
Ip =z = (how ~oto 1) = —J), (6.106) 


and which satisfy 


(7) IG = dua de) —0,pI #b,0F DP £ brad. (6.107) 


For the case of M = 2,N = 2M = 4, we note that I® correspond 
to infinitesimal generators of rotation for Weyl spinors and lead to 
2 x 2 matrix representations of SO(N). 

We can now construct more general spinor representations of 
the SO(N) group. Let us consider the case of N = 2M and write 
(the prefactor is a normalization constant, i = /—1) 


inM 
A = exp(—>—) 1172" Yam (6.108) 
which, as we have seen in (6.13), satisfies 


Ayt+twA=0, w=1,2,---,2M, (6.109) 
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and A? = 1 with the particular choice of our normalization in (6.108). 

Let (fi, fo,--:, fn) with fi > fo >--- > fn > 0 represent a 
Young tableau for the group SO(N) and Ths p5..u, denote the irre- 
ducible tensor corresponding to this diagram. (Here we are assuming 
that n < N and that fn4i, fn+2,°°: , fn = 0.) For the 2!*+! dimen- 
sional spinor of SO(N), we consider a general spinor-tensor given 
by 


Wer piaesjig OD OL pa peo-pees (6.110) 
and introduce the two spinors obtained from this as 
(H) = (1EA (6.111) 
Pp po" bn _ 9 ( fas ) Pugnean 7 


If we choose A to be (2” x 2™) block diagonal of the form (A? = 1) 


1 0 
ie é = (6.112) 


with the 7, matrices of the form in (6.77) 


O oy 
= ; 6.113 
Yu € 0 ( ) 


it is clear from the definitions in (6.111) and (6.112) that the spinors 
ot ee pn are effectively 2™ dimensional spinors which are the analogs 
of the 2 dimensional Weyl spinors of the Lorentz group. 

Although the spinors @,,,9..u, furnish representations of the 
SO(N) group in a natural manner, these are not irreducible in gen- 
eral, since it contains nontrivial subspaces such as 


N 

x (+) 
ae Or rsisees (6.114) 
A=1 


In order to make them irreducible, we need to impose conditions such 
as 


N 

dA 4 (+) = 
SO eniaie = 0) (6.115) 
A=1 


and so on. We note here that the condition (6.115) also automatically 
implies that 


N 

Xr (+) _ 
NY Pia niiens = 0s (6.116) 
A=1 
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which can be seen, for example, by considering a simple case. Let 
us consider a symmetric or ant-symmetric spinor-tensor satisfying 
of = eo), € = +1. In this case, 


N 
> ¢® =0, (6.117) 
A= 
implies 
N N 
Vevey =D Vey? = 0. (6.118) 
AL A=1 


As a more complicated example, let us consider a spinor-tensor oo 
corresponding to the Young tableau (see (5.43)) 


+ flv 
burt Fy 


which satisfies (see (5.31)) 


oO = 60) = -9 2, 6 + dQ + 08 = 0. (6.119) 


In this case, the relation 


N 
> 62 =0, (6.120) 
p=1 


leads to 


N se N - 
d 7 oe) = d. 7p =0, 


N 
> TP ion ae Ds (#3) 9 oe) = 0. (6.121) 


The representations for (odd) N = 2M +1 can be reduced to the case 
of SO(2M) since (see (6.24)) A = cy 1 (or they can be constructed 
in a similar manner). 


In ending this section, we note that if aj, ali = 1,2,---,M 
denote annihilation and creation operators for fermions satisfying 

aja; + aja; =0= ata! + alal, 

aol +ala;=dy1, 1,7 =1,2,---,M, (6.122) 
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then, the 2M Hermitian operators (i = 1,2,--- , M) 


1 Vicott é 1) =f 
w= Fy(atal) al ree = Ty (a al) = than 
(6.123) 


can be checked to satisfy the algebra 
Wi + Wp = 261, py =1,2,--- 2M. (6.124) 


The underlying (Hilbert) vector space of the fermionic theory can be 
constructed starting with the vacuum state |0) satisfying 


aj|0) = 0, eS 2 ,M, (6.125) 
and consists of the states 


|i) = a! 0), li1,42) = al al 0), |é1,é2,43) =a! al alloy, (6.126) 


a1 "22 a1 02 23, 


and so on where the states are anti-symmetric under the interchange 
of any pair of indices. In particular, the state with the maximum 
number of possible labels is given by 


|i1,72,-°° HRT) =alal,---al \0), (6.127) 


and is totally anti-symmetric in all its indices. The total dimension 
of the vector space can now be calculated easily 


1 
1+M+5,M(M-1)+---+M+1=(1+1)¥ =2™, (6.128) 


which reproduces (yet another way) the dimension of the irreducible 
representation of the Clifford algebra for the case N = 2M (see 
(6.41)). 
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CHAPTER 7 


Lorentz group and the Dirac equation 


7.1 Lorentz group 


In section 1.2.5 we had briefly introduced the concept of the Lorentz 
group. Since it plays an important role in the study of relativistic 
phenomena, we will look at this group in more detail in this chapter. 

Special relativity requires consideration of the non-compact or- 
thogonal group O(1,3) ~ O(3,1) instead of the compact group O(4) 
more appropriate for Euclidean rotations. As we know, the quadratic 
form of the Minkowski space 


et? — (2? +y? + 27), (7.1) 


where c denotes the speed of light in vacuum, is invariant under 
transformations belonging to O(3,1). It is more convenient to define 


a = (2°, x, x”, 2°), (f2) 
with 
e=acd, vi=az, e=y, w=z, (7.3) 


so that we can write the quadratic form in (7.1) also as 


(x)? — (at)? — (#7)? — (2°)? = tte”, (7.4) 


where the repeated indices y,v = 0,1, 2,3, are summed and we have 
identified 


0, if wy, 
it = aps 1, af pev=G, (7.5) 
-—1 if pw=v=1,2, or 3. 


In this compact notation we can define the Lorentz transforma- 
tions as 


ch — g'# = gh a”, (7.6) 
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for real constants a’, = (a",)*. Invariance of the quadratic form (7.4) 
under these transformations now leads to 


/ Iv V 
vt 2 =e a, 
| ae,. Vy _ aN 
Or, Nw (a 5a ) (a a) Sapper wes 
boy 
OF, MwA" ya”, = Nrp- (7.7) 


Let us next introduce the 4 x 4 matrices 7 and a as 


1 O 0 0 
0 -l1 O 0 
(n) ww = Nv or, n= 0 0 ae 0 ; 
0 O QO -1 
on Wy Osa", 
1 1 1 1 
Qy_, a, As a 
(Gass or, a= ? : 2 o ‘ (7.8) 


ay a a) 0% 
ao a4 a, a®s 


so that we can write the condition in (7.7) also as the matrix equation 
(note that (a7) yu» = (a)yu = a”,) 


LL 
a’ na — 1; (7.9) 
with the reality condition 


a =a; (7.10) 


We also note from (7.8) that 


v=n, =n, dety=-1. (7.11) 


Let us next define 7“” as the tensor with the same numerical 
values as 7,,, namely, 


0, if wy, 
les | ie 1; ti pay =U; (7.12) 
-—1 if w=v=1,2, or3. 


It is clear that it satisfies 


ny, = ne L, y= 0, 1, 2, 3, (7.13) 
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so that 7"” really represents the matrix elements of the inverse matrix 


n + although numerically we have 7! = 7, namely, (see (7.6)) 
(7), =”. (7.14) 


We can now derive from (7.9) that 


a’ =na ly eT =a 1a? ny, (7.15) 
as well as 
an a Sona 4g) Se (7.16) 


The tensors 7” and 7, define the metric tensors of the underlying 
Minkowski space and using them we can now raise and lower the 
tensor indices as 

Shwe = (2°, —a!, —2?, —2*), 

ag! = 7s, = (a°, 2,27, 2°), (7:17) 


as we would expect (see (7.2)). 

The set of all such matrices a (satisfying (7.9), (7.10) and (7.16)) 
defines the Lorentz group Ly. In fact, if a,b € Iba, then we note from 
(7.9) and (7.10) that their matrix product satisfies 


(a6) =a0 Sab; 
(ab)? n(ab) = b* (ab na)b = b' nb = n, (7.18) 


so that ab € Ly. Moreover, 14 defines the identity matrix of Ly. 
However, La is non-compact unlike the compact real rotation group 
R4 (see also section 1.2.5). To see this, we note that the 00 element 
of (7.9) (or for 4 = p = 0 in equation (7.7)) leads to 


(a%)° - (a9) =1, 
or, (a%)” =1+ (a’9)” =1+ (ain)?, i=1,2,3. (7.19) 
Similarly, taking the 00 clement of (7.16) we obtain 
(a%)° =1+ (a%)° = 1+ (a0s)?. (7.20) 


Since a°,,a’) and a®, are real numbers, (7.19) and (7.20) show that 
they can take unbounded values so that the group manifold for Ly, is 
unbounded and non-compact. We also note from (7.19) that 


(ao)? S14; 


ory 05) S44. cory, 29) <1: (7.21) 
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Using (7.11) we note from (7.7) that 
det (a" na) = det 7, 
or, det (det a)* = — (deta)? = -1, 
or, deta = +1. (7.22) 


Lorentz transformations for which ay < —1 involve time rever- 
sal. For example, consider 

>a, ai > xt, (7.23) 
belongs to this class and it is clear that for such a transformation 
deta = —1. Similarly, the transformations with a space reflection 
(parity) 

ose, eta a2, 1 =1,2,3, (7.24) 
correspond to the class of transformations with a5 > 1 and also sat- 
isfy det a = —1. Clearly these transformations (group elements) can- 
not be continuously obtained from the trivial transformation (iden- 
tity element). (Time reversal and space reflection are, in fact, dis- 
crete space-time symmetries. We note from (7.24) that 7“ + x, (see 
(7.17)) corresponds to the transformation of space reflection.) This 
leads us to define various subgroups of IL, depending on the values 
of a°) and det a in the following way. 


7.1.1 Proper orthochronous Lorentz group. If we restrict the set of 
matrices a € Ly to satisfy 


i) ly. dete= 1, (0225) 


this defines the proper orthochronous Lorentz group. Clearly, (7.25) 
does not allow either time reversal or space reflection (parity). This 
contains the identity matrix and it can be checked that this defines 
a subgroup of Ly in the following manner. If a,b € ly satisfying 
ay > 1 and DG > 1, then 


(ab)°y = ab’ = Gi gd’ + bo = and’ = a°.bio, (7.26) 


where we have used (7.17) to lower the spatial index. On the other 
hand, using the Schwarz inequality (as well as (7.19) and (7.20)) we 
have 


a; bio < |a°,||bjo| = x (a°,)?(bj0)? 


= 4 ((a%)? — 1) (0%)? - 1), (7.27) 
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so that (7.26) leads to 


(ab)% > a%yb% — / ((a%)? — 1)((6%)? — 1). (7.28) 


Let us further note that 


or, a 96% — 4/ ((a%)? = 1)((0%)2 = 1) > 1, (7.29) 
where we have used Qs ae > 1 and, consequently (7.28) leads to 


(ab)% > 1. (7.30) 


7.1.2 Orthochronous Lorentz group. If we restrict to group elements 
a € Ly satisfying only iG > 1 and allowing for both deta = +1, 
then such elements also define a group known as the orthochronous 
Lorentz group. This does not allow time reversal, but includes space 
reflection (parity). We note that for the pure space reflection defined 
in (7.24), we can identify 


c= %, (7.31) 


and it is clear that (7.9) and (7.10) are satisfied trivially in this case. 


7.1.3 Improper Lorentz group. Ifa © Ly, satisfies det a = 1, but allows 
for both are > land we < —1, then this defines the improper Lorentz 
group. Clearly, in this case both time reversal and space reflection 
(parity) are allowed together. 


7.2 Generalized Clifford algebra 


In chapter 6, we have studied the Clifford algebra in the Euclidean 


space where the metric tensor was the Kronecker delta d,,. In Minkowski 


space we have the nontrivial metric tensors given by n,, and n'” (de- 
fined in (7.5) and (7.12)). We can now generalize the Clifford algebra 
(6.1) to the Minkowski space as 


YeVv or Wwe = 2Nww 1, M,Yv= 0, 1s 2, 3, (7.32) 
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in order to study the Lorentz group. The representation theory of 
this generalized Clifford algebra, however, remains unchanged. In 
fact, if we redefine 


Y=%, VW= Ht, (7.33) 
the algebra (7.32) takes the form of (6.1) 
ere 5 Vi = 26 uw 1, (7.34) 


so that it has a unique four dimensional irreducible representation. 
The standard representation of the algebra (7.32) is known as 
the Dirac representation and has the form (7 = n”7,) 


3 ie <0 ree 
=A = ’ a aD » t= 1,4,9, 
YD YOo 0 =i YD Vid oc; 0 


(7.35) 
with the charge conjugation matrix C’ given by 
0 o2 
C437, = ( (7.36) 
02 0 


Here o;,i = 1,2,3 denote the three 2 x 2 (Hermitian) Pauli matrices 
defined in (6.55). We note from (7.35) that 


(yb) = 797578, w= 0,1,2,3. (7.37) 


Another commonly used representation (which is, of course, related 
by a similarity transformation and, therefore, equivalent) is known as 
the Weyl representation where the generators of the Clifford algebra 
take the form 


0 oO 

bb be 
w= | ~ ; 7.38 
(: 0 ( ) 


where 
on = (12, 0%), Fi = (le, —9;) (= of). (7.39) 
The o,, matrices satisfy 


~T ~ p 
02002 =F, 9201.02 =O, (7.40) 
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so that, in this representation, the charge conjugation matrix can be 
identified with 


—02 0 
ia ( a | (7.41) 


Note that the y{, matrices in the Weyl representation also satisfy 
(7.37) which is independent of (equivalent) representations. There is 
yet another useful representation known as the Majorana represen- 
tation where all the generators are purely imaginary and have the 
forms 


7 _ 0 a9 A a 103 +O 
sa o. O]}° a 0 ios)’ 
0 —-—oa2 —101 0 
2 3 
= : = ; 7.42 
Yu @ 0 Yu ( 0 ‘ ( ) 


These matrices still satisfy (7,)' = 79,779, and since all the 7%, 
matrices are purely imaginary, namely, (y\,)* = —7yh;, it follows that 


tu Veer = (he)! = —Oh)*, (7.43) 


so that we can identify the charge conjugation matrix with (the nega- 
tive sign is a phase convention and the reason behind this will become 
clear at the end of the next section) 


0 -«a 
Cu = 7h = (°. o : (7.44) 


7.3 Dirac equation 


Let us next consider the motion of a charged fermion with mass m and 
charge e interacting with an external electromagnetic field described 


by the Dirac equation (repeated indices are summed) 
me 


(1! (21 — 554u) — 5) ¥@) = 0, (7.45) 


where x“ denotes the space-time coordinates defined in (7.2) and 
v(x) is a four component spinor 


v1 (2) 
Wo(x) 
W3(x) |’ 
pa(x) 


x 


p(x) = (7.46) 


4 
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on which the 4 x 4 matrices, 7, operate (note that y“ = n#”y,). We 
also note that c and h denote respectively the speed of light in vacuum 
and the Planck’s constant while A,, represents the vector potential for 
the electromagnetic field which is assumed to be Hermitian, namely, 
A(x) = A,(x). Furthermore, to simplify our discussions we will 
choose units in which h = c = 1 so that (7.45) takes the form (0, = 


(a) 
Ox ) 
(1 (i, — €A,(w)) - m) W(x) = 0. (7.47) 


The Dirac equation is a relativistic equation and we will next 
show that it transforms covariantly under a Lorentz transformation 
(7.6) 


ch eh = ah’, aely. (7.48) 
To this end let us define 
FM = aly”, (7.49) 
which leads to 
Aue 4 Fae = ala’, (ry sie ry) 
= ala", (20 1) =) (an~*a*) 1 
=2 Gr) s 1 = 2n” 1, (7.50) 


where we have made use of (7.16). From the uniqueness of the irre- 
ducible representation of the Clifford algebra, we conclude that there 
exists a 4 x 4 matrix similarity transformation S(a) such that (see 
also (6.99) for the Euclidean case) 


iM = S*(a)y"S(a), (7.51) 


such that we can write (see (7.49) and (7.51), summation over re- 
peated indices is understood) 


S-1(a)y"S(a) = a",7", (7.52) 


for any Lorentz transformation a € Ly. 
We next note (using the chain rule) that the cogradient vector 
can be written as 
6) Or” O ' 


On = 59 = Gop gw = Mu (7.53) 
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where we have used (7.6). This can be inverted (using (7.15)) to write 
O,, = 4, Oy. (7.54) 


Under a Lorentz transformation the electromagnetic potential A,,(«) 
transforms like the vector x, or the vector 0, so that we can write 
(see (7.53) and (7.54)) 
Aj (a) = A, (a) =a, A,(x), 
Or; Ag) =a, Ala). (7.55) 


Therefore, using (7.52)-(7.55) we can write 


x (40u — eAy(z)) = y"a", (ia, — cA, (2’)) 
= §-la) (7 (ia, 2 eA, (2")) J S(a). (7.56) 


S 
or, 7” (ia, me eA, (2")) = m| w'(a") = 0, (7.57) 
where we have identified 
p'(2') = S(a)p(2). (7.58) 


We note that (7.57) has exactly the same form as the original Dirac 
equation (7.47). 

Let us next consider two Lorentz transformations, a,b € 4 and 
note that (see (7.52)) 


S~*(b) (S~*(a)y"'8(a)) S(b) = S~*(b) (a4 7”) S(b) 
= al, S*(b)7"S(0) = al by? 
= (ab)",7* = S~!(ab)y"S(ab), (7.59) 
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so that 
[v“, S(ab)S~1(b)S-*(a)] =0, (7.60) 
which lets us identify (by Schur’s lemma) 
S(ab)S~*(b)S~'(a) = Al, 
or, S(ab) = AS(a)S(b), (7.61) 


with A a constant. If we impose the condition det S(a) = 1 as in the 
previous section and note that S(a) is a 4 x 4 matrix (in the Dirac 
spinor space), (7.61) determines 


Me=1, or, A=H+1,4i. (7.62) 


The continuity of the group elements (as we have argued earlier) 
selects 


N=1, (7.63) 


if we are restricting to proper orthochronous Lorentz transformations 
and in this case the similarity transformation (matrix) defines a 4- 
dimensional representation of the proper orthochronous group. (In 
general, however, S(a) can lead to a ray representation of the Lorentz 
group.) With this we can identify ~/(x’) in (7.58) as the Lorentz 
transformed Dirac spinor and (7.57) as showing the covariance of the 
Dirac equation under a Lorentz transformation. 

Taking the Hermitian conjugate of (7.52) and recalling that a’, 
are real for Lorentz transformations, we obtain 


(S-*(a)y#'S(a))" = at, (9°), 
or, 9° (St(a)(y*)t(S~Yt(a)) 9° = at 9Cy")h9? = at", 
or, (9°st(a)7°) 4 (99st(a)q°) "= S(a)"S(a), 
or, [S(a)7°S* (a)9°, "| =0, (7.64) 


where we have used (7.37) as well as the properties y° = (y°)i = 
(7°)! in the Dirac representation. Therefore, we can write (by 
Schur’s lemma) 


S(a)y°S"(a)7° = b1, (7.65) 


7.3 DIRAC EQUATION 191 


with 6 a constant. Taking the Hermitian conjugate of this relation 
(and using 7° = (7°)', (7°)? = 1) we obtain 


°S(a)y°S' (a) = b*1 
or, S(a)y°S'(a)7° = b*1. (7.66) 
Comparing (7.65) and (7.66) we determine 
b = b* = real. (7.67) 


Taking the determinant of both sides in (7.65) and using the fact that 
det S(a) = +1, we conclude that 


b¢}=1, or, b= +1,+i, (7.68) 


and the two real roots of this equation are given by 6 = +1. Once 
again, the continuity of the group elements for the proper orthochronous 
Lorentz group with det S(a) = 1 determines b = 1. In fact, in this 
case, the argument can be extended to the entire Lorentz group in the 
following manner. Let us consider the reflection (parity) operation 
(see (7.24)) (2°, 2") > (2°, —x’) corresponding to the choice a = 
(see (7.31)). A simple analysis of the Dirac equation shows that it is 
invariant under (up to a phase) 


(a, e) = w (a, —z') = om p(x, ey, (7.69) 


which implies that we can identify $(7) = y°. On the other hand, 
this directly leads to (the phase simply drops out) 


S(n)7°St(n)y° = ()* = 1, (7.70) 


and comparing with (7.65), we conclude that b = 1. 

Although the finite dimensional matrices S(a) provide a rep- 
resentation of the Lorentz group, the representation is not unitary 
(ST(a)$(a) # 1) in conformity with the fact that the Lorentz group 
is non-compact (see, for example, the discussion in section 1.2.5). 
However, we will show next that w(x) provides an infinite dimen- 
sional unitary representation of the Lorentz group. To see this, let 
us take the Hermitian conjugate of (7.47) to obtain 


UN (e) (Co)! (i, - ey (w)) — m) =0, (7.71) 
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where we have used the fact that Al(x) = A,(x). Using (7y“)t = 
7yHy (see (7.37)) as well as (y°)? = 1, we can write (7.71) also as 


wt (x)y Ga (- i, — eA, ( (x)) —m) 7° =0, 


or, Ble) (14 (i. + eAy(e)) +m) =0, (7.72) 
where we have identified (7 is known as the adjoint spinor) 
B(x) = vl (a). (7.73) 


As aresult, from (7.47) and (7.72) we can deduce the conservation law 
(continuity equation) for the electric current in the standard manner 


a, (D(x)y"(2)) = 0. (7.74) 

Under a general Lorentz transformation ~(x) transforms as 

D(x) + (2) = (W(a')) "9° = (S(@v(a))'7° = ota) st(a)y? 
= ol (a)q°V°S"(a)7° = B(a)S“*(a), (7.75) 


where we have used (7.65) (with b = 1). This shows that the adjoint 
spinor w(x) transforms inversely as (a) under a Lorentz transfor- 
mation (which is why ~ is so important in studying relativistic Dirac 
systems). It follows now that 


‘Yw' (a) = P(x) S~*(a)S(a)b(x) = B(x)4(x), (scalar), 
x! )yMap'(a!) = h(x) S~'9"S(a)Y(a) 
= a h(x)" p( 
D (2), VW 2’) = B@)S (ay, Y1S(@)4(2) 


= ata’ W(x)[y* »y°|u(x2), (tensor). (7.76) 


Similarly, defining (see, for example, (6.12) and the factor of 7 is 
included in the definition to make the matrix Hermitian) 


B(x 
ob ( 


x), (vector), 


i 
Lapeay yy = einer y (7.77) 


where €,,,\p is the completely anti-symmetric Levi-Civita tensor with 
€0123 = —1, we can show that 


D (a')y5u" (x') = (deta) y(x)y5)(x), — (pseudo-scalar), 


B (a! )ysytl! (a’) = (det a) a4,D(x)957"'U(e), (axial vector). 
(7.78) 
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Let us consider the inner product for the Dirac field (spinor) 
defined as 


(lp) = / dx bt (x, x)b(2°,x), (7.79) 


which is clearly positive definite. We also note that (uw = 0,1,2,3 and 
i=1,2,3) 


5a f Pe vl(exu(a?.2 = f aoa les 
stn (2°, x)y°ab(a®, x) ) = f xo He ob(z)) 
=) ez ax tab(@)) — 8: (B(a)7'#(a))] 
e i xd; (D(a)y'b(2)) = 0, (7.80) 


where we have used (7.74) in the intermediate step and the last step 
follows if the fields fall off asymptotically, namely, w(x°,x) — 0 for 
|x| + oo. Thus, we see that for such fields, (|) is independent of 
a 

In order to prove that the inner product (~|w) is also Lorentz 
invariant, we use the following argument. Let o denote an arbitrary 
space-like surface, namely, any two distinct points 7”, y" on the sur- 
face o satisfy 


hie =o ie Sy = (2° es y°)? — (x? — y’)? <0. (7.81) 


Such a surface is manifestly Lorentz invariant. The most familiar of 
such a surface is, of course, a time slice defined by x° = constant 
(equal-time surface). The infinitesimal volume vector at any point 
on such a surface is given by 


doy(a) = (da'da*da®, da°dx*da*, dx°dz'dz*, dx°dzdz?) 


_dalde*da*da*) 
7 dat! 


(7.82) 


and this transforms like the vector x,, under a Lorentz transformation. 
Using this we can write the inner product as 


We = f doy (2) Be)", (7.83) 


oO 
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0 


which reduces to (7.79) for the surface x” = constant. Under a 


Lorentz transformation we have 


WIDe + Wer = / do!) Fa! yb! (a) 


= (U|P)o; (7.84) 


so that the inner product is invariant under a Lorentz transformation. 
Here we have used the fact that do, and wy"w transform inversely 
under a Lorentz transformation (one is like x, while the other is like 
x"). We have also used in the intermediate step our earlier observa- 
tion that the surface o is Lorentz invariant. 

The inner product (7)|~)> is actually independent of the surface 
go on which it is defined and, therefore, we can identify this with 
the definition in (7.79). To demonstarte this, let us consider the 
differential 


dF (a) 


i aay es (7.85) 
where 
ae : doy (x) d(a)y*ov(a), (7.86) 


and we assume that the space-like surfaces o and o’ differ only in an 
infinitesimal volume AV around the point y on o (see Fig. 7.1). In 


Figure 7.1: The surfaces o and o’ differ only around a point y in an 
infinitesimal volume AV. 


this case, using Gauss’ theorem we obtain 


5a(y) ad doy(x) pa)y"o(a) = a (d(y)y"'(y)) =0, — (7.87) 
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which follows from (7.74). Although we derived this result by consid- 
ering two space-like surfaces o and o’ differing only in an infinitesi- 
mal region, this result can be extended to other cases and proves that 
(W|w)> is independent of the surface o. As a result, we conclude that 
the inner product defined in (7.79) is invariant under any Lorentz 
transformation 


(YY) = ly), (7.88) 


so that the transformation 
p(x) > ¥'(2') = S(a)v(z), (7.89) 


is unitary with respect to this inner product. We would like to note 
here that the Hilbert space under consideration here is that of the 
first quantized Dirac field. A more physically interesting case is that 
for the second quantized Dirac field which we will not go into. 


7.3.1 Charge conjugation. Let us now return to equation (7.72) sat- 
isfied by the adjoint spinor and take its matrix transpose to obtain 


((7)? (10, + eAy(a)) + m) B' (x) =0. (7.90) 


Furthermore, recalling that (see (6.58)) the charge conjugation matrix 
relates 


(QE = Se =the: (7.91) 


we can rewrite (7.90) also as 


C7} (7! (iO, + eA, (a)) — m) CH" (x) =0, 
or, (7 (iO, + eAy(x)) — m) vo(x) = 0, (7.92) 
where we have identified 
Wo(x) = CB (a). (7.93) 


Therefore, we see that the spinor ~¢(2) satisfies the same equation as 
w(x) with the same mass m and opposite electric charge (—e instead 
of e). Consequently, we can interpret ~o(x) to represent the anti- 
particle state of the Dirac particle represented by ~(x) (and is known 
as the charge conjugate spinor which is the reason for the subscript). 
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Under a general Lorentz transformation, this spinor transforms 


as (see (7.75)) 
W(a") = CP (2) = C (P(x)S-'(a))* = C (B(x)S(a-))* 


= CST (a7!) (2) = CST (a™)071C¥' (2), 
or, wWU(2') = (CS7(a*)o™") Wo(z). (7.94) 


In particular, for space reflection (parity), we have already seen in 
(7.69) that 


Wey 2) Sy a), 

S(a)=7°, S(a4)=S (a) =7. (7.95) 
For such a transformation 

CS? (a-)071 = C(y)Fc71 = -7°, (7.96) 


so that we see from (7.94) that under a parity transformation, the 
charge conjugate spinor will transform as 


Vo(x®, —z') = = We a a) (7.97) 


This should be contrasted with the transformation of y(a) in (7.95) 
and implies that the parity quantum number of the anti-particle is 
opposite of that for the particle. 

On the other hand, for a proper orthochronous Lorentz trans- 
formations, we can show that 


C(S(a-!))? C7 = S(a), (7.98) 
in the following way. Let us recall from (7.52) that 

S*(a)y"S(a) = ah”. (7.99) 
Taking the transpose of this equation, we obtain 

(S(a))* (V4)7(S (a)? = 04,7"), 

or, ee *y#C)(S—*(a))? = at, (—C7"C), 

or, (S(a))7Cy#C(S (a)? = CT (a4 )C, 

or, ee *#C(S*(a))? = CTS (a)y"S(a)C, 

( 


or, y#C(S~*(a))7 CTS" (a) = C(S"*(a))? CNS *(a)y", 
or, [C(S~1(a))?C-1571(a), y"] = 0, (7.100) 
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where we have used (7.91) in the second line of (7.100). Therefore, 
by Schur’s lemma, we can write 


C(S—!(a))?C-1S—la) = Al. (7.101) 


Taking the determinant of both sides and recalling that det S(a) = 1 
for a proper orthochronous Lorentz transformation, as in (7.62), we 
determine that \* = 1 which leads to \ = +1,+i. The argument of 
continuity then selects \ = 1 and with this (7.101) leads to 


C(S-1(a))TO-1S-H(a) = O(S(a")7CO“'S“"(a) = 1, 
or, C(S(a—*))?C7! = S(a), (7.102) 


which proves (7.98). As aresult, we conclude from (7.94) that Wi, (2) = 
S(a)vc(x) and Wc(x) behaves much like the particle w(x) under a 
proper orthochronous Lorentz transformation. 


7.3.2 Weyl and Majorana particles. In ending this section, let us dis- 
cuss briefly the concept of Weyl particles as well as Majorana parti- 
cles. Let us first consider the Weyl representation of the y“ matrices 
(see (7.38)) given by 


sis & > . (7.103) 


oH 0 


and we write the four component spinor in terms of two component 
spinors as 


v(e) = i (7.104) 


where the two component spinors can be written as 


— [vi(2) are Y3(z) 
ea fone ea bane oy 


In terms of the two component spinors in (7.104), the Dirac equation 
(7.47) decomposes into two coupled equations 


o" (iO, — eAy(x)) (2) = me(z), 
a" (10, — eA, (x)) d(x) = mE(x). (7.106) 
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We note that under a parity transformation w(x°, 2’) > 7°u(x°, z') 
(see (7.69)) which leads to the fact that under a parity transformation 
(mirror reflection) 


o(a®, a") < E(x°, 2"). (7.107) 


This is consistent with the fact that under a Lorentz transformation 
the spinor components ¢() and €(x) transform among themselves. 

If the particle is massless (m = 0), then from (7.106) we see that 
the system of equations decouples 


o" (i0, — eA, (x)) E(x) = 0, 
&" (id, — eA,,(x)) (x) = 0, (7.108) 


and we can set either ¢(x) = 0 or (x) = 0 to have a consistent 
(two component) equation known as the Weyl equation (this is pos- 
sible only if m = 0). The two component spinors (x) and (x) 
are known as Weyl spinors and each of the dynamical equations in 
(7.108) is invariant under a proper orthochronous Lorentz transfor- 
mation. However, it follows from (7.107) that they are not invariant 
under a parity transformation. 

Let us next consider the Majorana representation of the 7“ ma- 
trices given in (7.42). In this case, the y“ matrices are purely imagi- 
nary, (y°) = —7° and we can identify the charge conjugation matrix 
with C = —7° (see (7.44)). As a result, the charge conjugate spinor 
can be written as 


Yo(a) = OF" (2) = (-9°) (veh) 


= y*(2), (7.109) 


where we have used (y°)" = —7y° in the Majorana representation. 


Namely, the charge conjugate spinor (describing the anti-particle) 
can be identified with the complex conjugate of the original spinor 
itself. (This is the reason for the negative sign in the definition of 
the charge conjugation matrix in (7.44).) From (7.92) we see that 
in the absence of the electromagnetic coupling (e = 0), the equation 
satisfied by the charge conjugate spinor is given by (see (7.92) as well 
as (7.109)) 


(iy"O, — m) w* (x) = 0, (7.110) 


7.4 REFERENCES 199 


which has the same form as the original equation (7.47) (for e = 0). 
Therefore, we can choose a real spinor ¢)*(x) = W(x) or Wc(x) = Y(z) 
so that in this case the particle is identical to its anti-particle (self 
charge conjugate). Such a particle is known as a Majorana particle 
and in this case, the phase of the parity transformation needs to be 
chosen as 


W(a®, x) > y' (2°, —2") = iy’ y (2°, 2°), (7.111) 


so that the reality condition for the spinor is maintained under the 
transformation (recall that 7° is purely imaginary in this representa- 
tion), namely, 


(w'(a°, —2*))* = Y'(a®, -2’). (7.112) 


We conclude simply by noting that there is no evidence so far for the 
existence of a Majorana fermion except possibly for a neutrino. 


7.4 References 


A. Das, Lectures on Quantum Field Theory, World Scientific, Sin- 
gapore (2008). 
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CHAPTER 8 


Yang-Mills gauge theory 


In physics gauge theories play an important role since the basic fun- 
damental forces are described by such theories. We are familiar with 
the Maxwell gauge theory from our study of electrodynamics. This 
is the simplest of gauge theories based on the symmetry group U(1). 
However, there are other gauge theories based on larger symmetry 
groups that, for example, are necessary to describe such fundamental 
forces as the weak force or the strong force. In this chapter we will 
discuss how such dynamical theories are constructed based on larger 
symmetry groups. 


8.1 Gauge field dynamics 


Let tg,a = 1,2,--- ,N define a basis of a Lie algebra L with com- 
mutation relations (here we are trying to use the notation used in 
discussions in physics as opposed to the notation in section 3.3) 


[ta, to] = ify tes a,b,c = 1,2,->- iN; (8.1) 
where 
foo = — fea» (8.2) 


denote the (real) structure constants of the group and repeated in- 
dices are assumed to be summed (in chapter 3, the structure con- 
stants were called C},, without the factor of “i”, see (3.93)). The 


generators t, are assumed to be Hermitian, tl = tq. Let Al (xz), @ = 
1,2,---,N represent the gauge field which has a Lorentz vector in- 
dex p = 0,1, 2,3 as well as a group index a and is a real function of 
the space-time coordinates 2 = (#°,a!,a?,x) in the 4-dimensional 
Minkowski space. We next introduce a L-valued gauge field A,,(x) 


(also known as the connection) by (the group index is being summed) 


Ay (x) = Aj (2)ta, (8.3) 
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as well as the field strength tensor (or the curvature) tensor F),,(x) 
associated with the gauge field by (we are setting the coupling con- 
stant for gauge interaction to unity as well as h = c = 1) 


Fuv(@) = —Fyy(x) = OpAr(x) — 0, Ay(x) + [Ap (x), Av()| 
= Fi, (x)ta; (8.4) 

where the components of the field strength tensor are given by 
Fa, (a) = 0,9 (x)—8,A4 (2) — ff A}, (2) AS (2) = — FS) (x), (8.5) 


and the anti-symmetry is manifest because of (8.2). Here we have 
used the result that (see (8.1)) 


iA, (2), Av(@)] = i [AP (eto, AS (@)te| 
= iA} (2) A5(2)[t, te] = tAN(@)AS (iffsta) 
=~ (fiAj(@)A5(2)) to (8.6) 


This follows because A? (2), AS (x) are classical functions of the space- 
time coordinates and we have used (8.1). (Note that if t*s correspond 
to a matrix representation of the symmetry group, A,, as well as Fy 
will also be matrices in that particular representation. ) 

Let us next define a space-time coordinate dependent group el- 
ement 


g(a) =e Fe, (8.7) 


of the corresponding Lie group G where €*(x) are N real functions 
of the space-time coordinates x. To get a feeling for g(x), we note 
that it cannot clearly be an element of the Lie algebra since the Lie 
algebra does not contain quadratic and higher order terms in the 
generators (basis elements). On the other hand, if we are considering 
an Xn matrix representation p(t,) of the Lie algebra, the generators 
will be n x n matrices. The exponential in such a representation, 
exp(—i€"(x)p(ta)), is also ann unitary matrix which is quite simple 
to manipulate with. We will use such a matrix representation for ta 
in this chapter (without explicitly writing p(t,)). 

Let us now consider a local gauge transformation of the form 
(it is called local because the matrix of transformation is space-time 
coordinate dependent, space-time independent transformations are 
called global) 


A(x) + Ay,(2) = g(x) Ay (2)g7 (x) + (O,g(2))9-*(x). (8-8) 
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We note here that (Ai, (a))t = A’,(x) is Hermitian if we recall that 
(see also (8.13)) 


Ou(g(a))! = Ou(g(a))~* = —9-*(x)(O,9(@)) 97 * (2). (8.9) 
We will verify shortly that A/,(x) can also be written as 
Ai, (2) = (AV (@)ta, (8.10) 


namely, Ai, (2) also belongs to the Lie algebra, but before doing that 
let us show that the field strength tensor F,,,(x) transforms covari- 
antly under the gauge transformation (8.8), namely, 


Fy (@) + Fry (@) = OA, (a) — OA, (x) + iA), (2), AL (@)] 
= o() F(x) 97" (2). (8.11) 
We note from (8.8) that 
OpAy (x) = Oy (g(x) Av(x)g™ (x) + i(Ag(x))g-*(2)) 
= g(x)(O,Ar(x))g~ (x) + (Oug(x)) Av(x)g-*(2) 
+ g(x)Av(x)(Oug~*(2)) + 1(OpA,9(x)) 9g *(2) 


+ i(A,9(2)) (Ong *(2)). (8.12) 
We recall that g(x)g~!(x) = 1 = g~!(x)g(x), which leads to 
(Aug *(@)) = —9"* (w)(Bug(@)) g(a). (8.13) 


Using this result in (8.12) we obtain 
OAL (x) = g(x)(OvAv(x))9-* (x) + (Gug(@))9-*(w) g(a) Arg *(@) 
— g(x) Ar(x)g” (a) (Oug(@))g- * (x) + i(O,A.9(x))g" (a) 
— i(d,9(x)) 9 * (x) (A,9(a)) 9 (@) 
= g(x) (QvAv(x))9- *(x) + [(Oug(@)) 9" *(x), g(@) AL(x) 9" *(2)] 
+ i(0,0rg(x))g- (x) — i(A,9(x))(Ong(2)) 9° *(2). (8.14) 
As a result, we can write 
O,AL (x) — OA), (#) = g(x) (O,Ar(x) — WAy(2)) 9 * (x) 
+ [(Qug(2))9-*(x), g(a) AL(x)g"*(2)] 
+ [9(x)Ay(x)g* (2), (@,9(a)) 9 (#)] 
+ i[(O.9(2))9-*(2), (@9(2))9-*(2)]. (8.15) 
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Similarly, we obtain in a straightforward manner 


i[A,, (2), AL (2)] = ig(@)[Ap, Av(a)]9-*(2) 
— [(A.9(@)) 9 (2), g(a) Ar (2) 9*(2)] 
— [9(x) A, (2)9-*(2), (@9(2))9-*(2)] 
— i[(8.9(@))g-*(a), (vg(@))9-*(@)]- (8.16) 

It follows now from (8.15) and (8.16) that 
F(t) = 0,A,(2) — O,A,,(a) + i[A, (2), AL(2)] 
= g(x) (8,A,() — OAy(2) + [Ap(a), Av(2)]) 9 *(2) 
= g(x) Fy(x)g7*(2). (8.17) 


showing that the field strength tensor does indeed transform covari- 
antly under the gauge transformation (8.8). 

Let us next come back to the question of whether A’, is a Lie 
algebra valued connection (gauge field) as stated in (8.10). We note 
that if X and Y denote any two linear operators (matrices), then we 
can define a one parameter dependent family of matrices as 


fAyse*Vve™, (8.18) 


where is a constant parameter. Taking the derivative of this func- 
tion with respect to the parameter A we obtain 


fC = of) se" (XYRYR)e* 
= (xX, Y]eO* = &* ((ad X)Y) e*, 
fd) = df) aynk ((ad X)°Y) eX, (8.19) 


dX 
and so on where we have used the definition of the adjoint operation 
(see (3.103)) 

(ad X)Y = [X,Y]. (8.20) 
Using these we can Taylor expand the function around A = 0 to 
obtain 

2S d\n CO yn 
nt 


12>) OO=>- 


n=0 n=0 


Gd X\°Y =XMyY, (8.21) 
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Now setting \ = 1 this leads to 


oe) 


Xe PaaS Ss" ss ¥ 
— nl 
=Y+IKY]+ SIG YI + SIG YI +. (8.22) 


Similarly, if we define another one parameter family of matrices 
as 


FO) = (a0*) eX (8.23) 


where 4 is a constant parameter and the matrix X depends on space- 
time coordinates, we obtain 
dF (A 
at ) = (a,(e*X)) eX + (a,e%*) (-xe**) 
(oe) nr 
Se Oe = ~ (ad X)"(IpX); (8.24) 

n=0 

where we have used (8.21) (with the identification Y = (0,X)). In- 

tegrating this with respect to A (between (0, )), we obtain 


iad \rti 
FA)=\¢ GE (ad X)"(0,,X). (8.25) 


n=0 


We note that F'(0) = 0 and setting A = 1 in (8.25) we obtain 


F(1) = (0,e") e* 


[oe) 1 a 
= » mp ay (XX) 


_< L s(ed xy 
~ 2. aay ad X (,%) 


ead xX 1 
= | —____ X). 2 
(-) x) (8.26) 
Let us now identify 


X = -i€%(a)t, € L, (8.27) 
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so that we can write (see (8.7)) 
g(x) = e8"(@)ta — eX, (8.28) 
Then it follows from (8.22) that 
9(2) Ay(x)g7*() = g(#)tag”*(x) A(x) = (e* tae) Ali(2) 
= (a2) AS (cr) 


2X LX, tl] ts +) A“(c). (8.29) 


= (1. + [X,ta] + T 


We note that we can write 


[X, ta] = —é [e°(x)to, ta] = —i€'(2) lt, tal 


= —i€°(a) (iffate) = —t€,°(a)te € L, (8.30) 
where we have identified (see (3.99) and (3.108)) 
E,°(a) = (2) (iff) = —€°(2) (adty),.°. (8.31) 
Note that the N x N matrix €(a) defined by 
ba (x) = (&(2) ac (8.32) 


is an element of the adjoint representation of L. Therefore we can 
write, in this notation, 


[X, ta] = (€(2) Jacte = —i(€(2)t)a. (8.33) 
Using this notation, we can now calculate 
[X, [X, ta]] = [X, -#(€(x)t)a] = (-4)?(E(@) (E(a)t))a 
= ((-€(2))*t)a, (8.34) 


where we have used (8.33) in the intermediate step. Higher order 
commutators can also be similarly calculated and substituting these 
into (8.29) we obtain 


ale)Ayg (2) = (ta + (6a) ta + 5 ((isle))Ft)0 4) AG) 
= (Bue + ieee + 5 (-ig@))P)E +) tego 


= (He) t-A% (2), (8.35) 
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which is indeed an element of the Lie algebra L. 
Similarly we can calculate (see (8.27)) 


OX = (—10,8"(2) ta, 
[X, (O.X)] = [X, ta] (—IOpE"(a)) = (08 (x) t)a(— tO" (x)), (8.36) 


and so on. Here we have used (8.33) in the last step. Using these in 
(8.26) we obtain 


(A,.g(x))g~* (@) = (O.X) + ibe (QnX)] + SLX [X (O.X)]] + 
= (to + (IC )Dn + F((I6(e))*Na ++) (i080) 
_ fe#@ -1\ ° 
= ( Hela) : te(—t0,6"(x)) 
=A t,(-10)6%(@)) EL, (8.37) 
where we have denoted 
Of (58 (ry)™ ee 18(®) — 
A(e) = S75 SG) = (8.38) 


(n+)! ~i€(z) 
We conclude from (8.35) and (8.37) that 
Ay (2) + Ay,(2) = 9(2)A,(2)g-"(2) + i(8,g(@))g- (a) 
= ((e#) © Ag(a) + (AC) (OnE"(@))) te» (8-39) 


which belongs to the Lie algebra LE and we can write the transforma- 
tion laws for the component fields as 


Ag (a) + Aja(a) = ((e#) "AS + (A(@)) (QuE"(2))). 
(8.40) 
From (8.17) we now see using (8.35) that the components of the field 


strength tensor (curvature) transform under a gauge transformation 
covariantly as 


Fis, (0) + Fi(2) = (e#) * Fs (2), (8.41) 
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without the inhomogeneous term proportional to A(x) present in the 
transformation of the gauge field (potential) in (8.40). Since the 
N x N matrix U(g) given by 


(U(9))as = (e8) *, (8.42) 


a 


gives the adjoint representation of the group G (see (8.32)), we con- 
clude that the field strength tensor Fii,(x),a = 1,2,---,N belongs 
to the adjoint representation of the group G. This is independent of 
any particular underlying n x n matrix representation Aq = p(ta) of 
the Lie algebra L used. 


> Example (Abelian theory). As an example, we note that the special case of 
N = 1 is of some interest. It corresponds to an Abelian group since the only 
generator of the group commutes with itself, namely, [t1,ti1] = 0 and we can 
identify the matrix A = p(t:) = 1. As a result, the group element (8.7) 


g(a) =e), (8.43) 


also becomes a commuting ordinary function. As a result, the gauge field (poten- 
tial) B,,(x) = A},(x) transforms under a gauge transformation as 


B(x) > Bi (x) = g(x) Byu(x)g~" (x) + (O92) 97" (2) 
= By (x) + (Qp€"(2)), (8.44) 


while the field strength (curvature) tensor (the commutator in (8.5) vanishes for 
the Abelian case) 


Gur (a)(= Fav («)) = 0,.BL(2) — 0,B, (2), (8.45) 
remains invariant under the transformation 
Gu (@) 4 Guv(«) = g(x)Gur(w)g7* (2) = Gur (2). (8.46) 


These reproduce the well known classical gauge transformations of the electro- 
magnetic potential (for example, with the identification €1(x) = a(x)). 
< 


Let us next return to the general case and note that since the 
field strength tensor transforms covariantly under a gauge transfor- 
mation (see (8.11)) 


Pew a) Piel 2)) (8.47) 
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where we have used the property of cyclicity under a trace. Therefore, 
we see that this quadratic combination of the fielod strength tensor 
is not only Lorentz invariant (all the Lorentz indices u,v have been 
contracted), but is also invariant under a gauge transformation (8.8). 
Furthermore, since Fj, = Fii,ta (see (8.4)), if we denote (see (3.111)) 


Jab = Tr (tat) , (8.48) 


we can write the invariant quadratic combination in (8.47) in terms 
of components as (recall that repeated indices are summed) 


Tr (FM (2) Fu, (x)) = Tr (tate) FY” *(2) Fi, (2) 
= gag Pe (a). (8.49) 


In general, we can choose the matrix representation p(tqa) such that 
it satisfies gap = $ Sab in the fundamental representation (to which 
the fermions generally belong) for physics problems as we will see 
shortly. As a result, with a proper normalization the Lagrangian 
density describing gauge field dynamics can be written as 


1 1 
Le =—5 Ry FM = —2 Fi Pe. (8.50) 


8.2 Fermion dynamics 


Let us now consider the Dirac field y;(a),i = 1,2,--- ,n in four space- 
time dimensions belonging to the n-dimensional representation of the 
group G, where for each index i, the field 7;(x) represents a four 
component Dirac spinor, namely, 


i1(2) 
$i,2(@) 
w;(2) = | °° & Persp, (8.51) 
” $1,3(2) 
$i,4(2) 
Let us assume that the Dirac field satisfies the equation (see (7.47), 
we have set h = c = 1 as well as the coupling constant to unity) 


(9 (Op — Au(@)) — m) o(a) = 0, (8.52) 
where we have identified (a) with the column matrix 
yr (a) 
2(a) 


v(x) = . F (8.53) 
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on which the generators t, (we identify t, with its n x n matrix 
representation) in A,,(x) = Aj,(x)ta act, namely, 


(ta(®)); = (ta)ighj(@), (8.54) 
for 7,7 = 1,2,--- ,n and a = 1,2,--- ,N. (Of course, every compo- 
nent of the fermion field corresponds to a four component spinor as 
in (8.51).) 

If we define the gauge transformation for the Dirac field to be 

W(x) > v(x) = g(x)Y(2), (8.55) 


where g(x) is defined in (8.7) for any representation, then, it is easy 
to see that the Dirac equation (8.52) is invariant under the combined 
transformations of (8.8) and (8.55). In fact, let us note that 


(9! (2, — Ay (@)) —m) W!(a) 


= (o" (Op — (9(2) Ang (x) + i(8,.9(x))9- *(2))) 


— m) 9(x)¥(e) 
= (1"((id,9()) + 9(@)id,. — 9) A, (2) — i(8,.9(@))) 
— mg(x)) v() 
= 9(2)(1" (By — Ay(2)) — m)¥(@) = 0. (8.56) 


For the transformation of the adjoint equation (7.72), we note 
from (8.55) that the adjoint spinor 7)(x) = #'(x)7° transforms under 
a gauge transformation in the form 


B(x) > B(x) = B(x) gl (2). (8.57) 


In order to make the bilinear product such as ¢(2)w(x) invariant 
under a gauge transformation, it is clear that we must assume that 
the n x n matrix g(x) is unitary, namely, 


gd @)g@)= Thy oy =e- (8.58) 
Furthermore, in analogy to the electromagnetic case of N = 1 (see 


discussion around (8.44)) where Bi(a) = (Al)'(x) =A (ec) = Ba), 
we assume that 


(At) \ (a) = Aa (a), GH T2468 nN, 
Al (x) = Ay (2), Ai) = Fa (@): (8.59) 
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(which is consistent with our choice of Hermitian generators t,) so 
that the adjoint equation has the form as in (7.72) 


D(x) (> (15, = A,(z)) i m) —0, (8.60) 


and which can also be seen as discussed above to be covariant under 
a gauge transformation (using (8.58)). The continuity equation (con- 
servation law) that follows from (8.52) and (8.60) continues to have 
the form (7.74) 


Ay (b(x)y"(x)) = 0. (8.61) 


The Lagrangian density which leads to the dynamical equations (8.52) 
and (8.60) has the form 


Ly = 0 (y* (Oy — Ay) — m) db. (8.62) 


The total Lagrangian density describing the interaction of fermions 
with Yang-Mills (gauge) fields is given by (see (8.50) and (8.62)) 


L=Lo+Lly 
= -5 Tr (F#” (x) Fyp(2)) 


+ h(x) (9 (Op — Au(x)) — m) ¥(a), (8.63) 


where we assume that the generators in the gauge field Lagrangian 
density are in the fundamental representation as discussed earlier. 
The Lagrangian density in (8.63) is invariant under the combined 
local gauge transformations (see (8.8), (8.55), (8.57) as well as (8.58)) 


v(x) > W'(e) = g(x)v(2), 
D(a) 3 U(x) = B(x)g*(2), 
A, (a) + Al(a) = g(x) Ay (x)g71(a) + i(O,9(@))g7 (a). (8.64) 


The invariance of the first term in (8.63) has already been shown in 
(8.47) and, therefore, we only need to check the invariance of the 
Lagrangian density involving the Dirac fields. If we use the covariant 
transformation derived in (8.56) as well as the transformation of (2) 
from (8.64), we obtain 


/ 


w (x) (y" (10, a Ai,(x)) - m) w(x) 
x)g*(«)g(x) (9 (iO, — Ay(x)) — m) H(a) 
x) (y"* (tOn — Ay(z)) — m) p(2), (8.65) 


212 8 YANG-MILLS GAUGE THEORY 


which proves the invariance of the fermion part of the Lagrangian den- 
sity as well so that the total Lagrangian density in (8.63) is invariant 
under the local gauge transformations in (8.64). This Lagrangian 
density is the starting point in the study of quantum chromodynam- 
ics (QCD) which describes strong interactions between fundamental 
hadronic particles. 


8.3 Quantum chromodynamics 


We have already noted (see (8.58)) that it is important to assume 
the group element g(x) to be unitary for applications in physics. 
Furthermore, since we also know that only compact groups can have 
nontrivial, finite dimensional unitary representation (see Peter-Weyl 
theorem in section 2.4), we must consider compact Lie groups for 
G. In this case, the corresponding Lie algebra is known to be re- 
ductive, namely, it must be at the most a direct sum of Abelian and 
semi-simple Lie algebras. Moreover in such a case (see, for example, 
(3.111)) there exists a representation p(t.) such that the metric 


Jab = Tr (p(ta)p(te)) = Goa; (8.66) 


is non-degenerate and its inverse g@ exists. To see this in some more 
detail, let us write any g(x) € G in the exponential form (see (8.7)) 


g(x) = e 8" eta) (8.67) 


for real local parameters €°(a). Since we would like g(x) to be unitary 
(see (8.58)), this determines the matrix generators to be Hermitian 
(which we have already assumed) 


(p(ta))* = p(ta), (8.68) 


As a result, the metric 


dav = Tr (p(ta)p(te)) = Tr ((p(ta)*p(te)) > 0, (8.69) 
is a positive definite second rank tensor in the sense that for any real 
constants c*,a = 1,2,--- , N (repeated indices are summed) 

gasce? > 0, (8.70) 


where equality in (8.70) holds only for c* = 0 for alla = 1,2,--- ,N. 
Since gqp is a Hermitian (in the space ab) and positive definite matrix, 
it can always be diagonalized through a proper choice of the basis tg 
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to be proportional to the identity matrix (this takes some simple 
algebra to show) 


Jab = C dap, (8.71) 


where C is a positive constant. In particular, this constant is chosen 
to be C = $ for the fundamental representation so that in this rep- 
resentation gap = 5 Sab as we have mentioned earlier (see discussion 
following (8.49)). Let us point out here that if Z is an Abelian alge- 
bra, then clearly the generator can be chosen to be a multiple of the 
identity matrix leading to a non-singular gap. 

With this background, let us next discuss the basics of quantum 
chromodynamics (QCD). It is an interacting gauge theory based on 
the compact gauge group G = SU(3). However, in order to avoid any 
possible confusion with the flavor SU(3) group (of fermions/quarks) 
to be considered in the next chapter, we will refer to this group as 
G = SU-.(3) and denote the flavor symmetry group as SU;(3). Here 
the subscript c refers to the three colors - red, blue and green - that 
the fundamental constituent fermions, known as quarks, can have. 
The quarks belong to the fundamental representation of SU.(3) (as 
we have mentioned before, fermions belong to the fundamental rep- 
resentation) which is three dimensional. On the other hand, as we 
have seen, gauge fields belong to the adjoint representation which, for 
SU,(3) (in general, for any SU(3)), is eight dimensional (the adjoint 
representation of SU(n) is (n? — 1) dimensional). Correspondingly, 
the gauge fields (potentials) are written as Ali(x),a =1,2,--- ,8 and 
are known as gluon fields. 

The three colored quarks are labelled by the index 7 = 1,2,3 
(1 for red, 2 for blue etc.) and are known to come in six differ- 
ent flavors: u (up), d (down), c (charm), s (strange), t (top) and b 
(bottom) and, of course, each flavor of quark comes in three colors 
so that we can label the quark fields correspondingly as u;(x), d;(x) 
and so on. In addition to the color quantum number, quarks also 
carry electric charge. In units of the magnitude of the charge of the 
electron, the charge carried by the u;(x),¢;(a) and the t;(x) quarks 
is 3 while that carried by the d;(x),s;(a) and the b;(a) quarks is 
—i. Quarks are the basic constituent of all hadronic matter and are 
different from leptons in that they interact strongly. (Leptons can 
have only weak and electromagnetic interactions while quarks par- 
ticipate in weak, electromagnetic as well as strong interactions.) If 
we denote the quarks collectively as qaji(x),a@ = 1,2,--- ,6 where 
Mi(L) = u(x), q2i(z) = di(x) and so on. Then we can write the 
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general Lagrangian density for the six quarks interacting with gluons 
(color gauge fields) as (repeated indices are summed) 


La = Ga,i(&) (Y" (@Opbij — (Ap )ig(@)) — Mediz) da,j(x), (8.72) 


where we have identified the gauge field in the representation of the 
quarks (fundamental representation) as 


(Ay) is (@) = Ap (2) (ta)ag s (8.73) 
with (t,);; representing the generators of SU,(3) in the fundamental 
representation. Furthermore, mq, a = 1,2,--- ,6 in (8.72) denote the 


bare masses (also known as the current quark masses) of the different 
flavors of quarks. 

The peculiarity of the strongly interacting theory called QCD is 
that the building blocks of matter, namely, the quarks, are confined 
in space. In other words, we cannot experimentally observe isolated 
quarks. Only their bound states belonging to the trivial representa- 
tion (color neutral) of the color group SU,(3) manifest as hadrons 
such as proton p, neutron n etc. which are observed in experiments. 
This is the reason why the three colors of quarks are termed red, blue 
and green since they can naturally combine to give white color which 
symbolically represents a color neutral state (trivial representation). 
For example, the three pi meson states (t+, 7°, 7~) with spin-parity 
J? = 0- (J,P correspond to spin and parity respectively) and iso- 
topic spin J = 1 can be described symbolically as color singlet bound 
states of the u;,d; quarks and their anti-particle states (u;,d;) of the 
forms 


— ujdj, no = Slt + didj), nw = djtj. (8.74) 
The spins of the constituent quarks must be anti-parallel in order for 
the bound state to have spin 0. This can, of course, be made more 
quantitative through the introduction of creation and annihilation 
operators for the quarks as well as states in a Hilbert space in a 
second quantized theory. But, we can avoid all these technicalities 
here and conventionally one says that 7+ = ud, just as we say that 
the deuterium is a bound state of a proton and neutron D = pn (we 
use D for deuterium to avoid confusion with the d quark). Similarly, 
the proton and the neutron can also be written as bound states of u 
(up) and d (down) quarks of the forms 


p= ejrujujdy, or, simply p= wud, 


N= Ejpusdjdy, or, simply n= udd, (8.75) 
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which are clearly color singlet states. Here €;;x,7, j,k = 1,2,3 de- 
notes the completely anti-symmetric Levi-Civita tensor in the three 
dimensional color space. 

The values of the bare masses mg,a = 1,2,--- ,6 of quarks ga 
are deduced from experiments to be around 3-5 MeV for the u (up) 
and the d (down) quarks while they are about 150 MeV for the s 
(strange) quark, 500 MeV for the c (charm) quark, 5 GeV (= 5000 
MeV) for the 6 (bottom) quark and 150 GeV for the ¢ (top) quark. 
The reason underlying the large spread in the values of the masses 
is not yet well understood theoretically. As a result, it is important 
to study different energy regimes (scales) separately. For very low 
energy experiments, say below 50 MeV, the contributions due to the 
s,c,b and t quarks are essentially negligible because their masses are 
much heavier. In this case, it is sufficient restrict the theory in (8.72) 
to the sector containing only the u,d quarks. Furthermore, ignoring 
the small masses of these two quarks, it is easy to see that the theory 
has a U(2) flavor symmetry (also written as U;(2)), namely, the 
Lagrangian density in (8.72) (with only the u,d quarks and masses 
vanishing) is invariant under a transformation of the form 


() ’ (" -u () | (8.76) 
where U denotes an arbitrary, global (space-time independent) 2 x 2 
unitary matrix. However, since the U(1) subgroup of U;-(2) is related 
to baryon number conservation, we restrict the matrix U in reality 
to SU;,(2) which is known as the isotopic spin group. It defines 
the isotopic spin J already introduced above and takes values J = 
0, 5 1, 3, +++ just as ordinary spin. In this sector (of u,d quarks), 
the only baryons are p (proton) and n (neutron) as we have already 
seen in (8.75) and they have I = 5 just like the u,d quarks. Nuclear 
matter is described as bound states of these nucleons p and n. For 
example, we can think of He® and H® as corresponding to the bound 
states He? = ppn and H? = pnn analogous to p = uud,n = udd. 

Next let us suppose that we are considering experiments at en- 
ergies below 500 MeV. In this case we may neglect the effects of the 
c,b and ¢ quarks thereby dealing with only the u,d and s quarks. 
Restricting the Lagrangian density (8.72) to only these three flavors 
of quarks and ignoring their mass differences, it is easy to check that 
the theory is invariant under transformations belonging to the SU;(3) 
group (actually, it is invariant under U(3) group, but we ignore the 
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U(1) subgroup as explained earlier), 


Uj U; Uji 
d; — di =U d; ; (8.77) 
3; si 34 


where U is an arbitrary global 3 x 3 matrix belonging to the group 
SU;,(3). This SU(3) group, which we will refer to hereafter as the 
flavor SU;,(3) group to distinguish it from the color SU,(3) group, is 
in reality violated by the mass differences between m,,mq and mg. 
Finally, we note that if we are considering extremely high energy 
experiments, say in the region of energy 1 TeV or larger, then we need 
to take all the six quarks into consideration. Furthermore, if we ignore 
all the masses (or mass differences), then the Lagrangian density 
(8.72) is easily seen to be invariant under global transformations of 
SU;(6) (or U(6)). However, it is worth pointing out that so far no 
such experiment exists (except possibly at LHC or at big bang). 
To be a little bit more specific, let us define the matrices 


(J*(@))"3 = Gaile) V9ei(@), T(x) = do(x)a8(2); (8.78) 


in the flavor space where a,§ = 1,2,---,6 and 2 = 1,2,3 with 
repeated indices summed. From the dynamical equations for the 
fermions following from (8.72), we can obtain 


On (JM (a))"g = (Ma — mg)TS (2), (8.79) 


for fixed a, G. It is clear from (8.79) that if ma = mg, namely, if the 
quark masses are flavor independent, then the flavor current matrix 
defined in (8.78) will be conserved, 


8, (J#(a))% = 0. (8.80) 


As a result, we can define a conserved flavor charge matrix 
ee [ee Cie) ee (8.81) 


which will be independent of x° (time). Furthermore, the equal-time 
(2° = y°) canonical anti-commutation relations for the quark fields 
give 

[ga,e(2), 9,5 (y+ = 0 = [ah «(x), ab s(wl+s 


[4a,i(2); a (W)]4 = 4085i;5°(a — y), (8.82) 
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where a, = 1,2,--- ,6, 7,7 =1,2,3 and the anti-commutator of two 
operators A, B is defined to be [A,B], = AB + BA. Furthermore, 
we have suppressed the spinor indices (of the Dirac fields as well 
as the anti-commutation relations) in (8.82) only to bring out their 
behavior in the internal space. With the help of (8.82) it can readily 
be checked that the charges satisfy the u(6) Lie algebra (see section 
30.9), 


By | = OX _ One) (8.83) 
where a, 0, y,0 = 1,2,--- ,6. We can also check that 
[X%, TY (2)] = 61 p(2) — OTS (2). (8.84) 


However, we will only be concerned with experiments in the in- 
termediate energies where only the u, d and the s quarks are relevant, 
namely, we will restrict the theory in (8.72) only to these three quark 
flavors so that the flavor indices a, 3 = 1,2,3. Since (a = 1,2,3 and 
repeated indices are summed) 


Bax. (8.85) 
corresponds to the quark or the baryon number (generator of the 
U(1) subgroup), it is more convenient to define the su;(3) sub Lie 


algebra by defining the (traceless) generators (charges) (see (3.52) as 
well as (3.54)) 


a a 1 a 
Q g = X%— 3 BB, (8.86) 
which satisfies 
[Q"s, | = 6Q"s — 5 '4Q%G: Os = 0, (8.87) 


for a, 0,y,0 = 1,2,3. Similarly, we can define 
OL (e4 [ea 1 (e4 
T(x) =T%, S%3(x) = TG(x) - 3 6°T (2), (8.88) 


so that T(x) and S%(x) would belong respectively to the singlet and 
the octet representations of SU;(3). With this notation, we can write 
the mass term in the Lagrangian density in (8.89) which violates the 
SU;(3) symmetry as 


Liat = (My +meqt+ms)T 


(2m, — my, — mg) S°3 (a), (8.89) 
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where we have identified my = my, M2 = mg, M3 = ms. We note 
that the SU;,(3) violating terms in £,, consist of only the octet 
terms involving mass differences (the singlet term is invariant un- 
der SU;(3)). As we will see in the next chapter, this leads to the 
SU;(3) mass formula for the baryons. The presence of the singlet 
terms in the Lagrangian density will not affect this discussion as we 
will see in the next chapter. 
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CHAPTER 9 


Quark model and SU;(3) symmetry 


9.1 SU; flavor symmetry 


As we have seen in the preceding chapter, the QCD Lagrangian den- 
sity (8.72), when restricted to the first three flavors of quarks u, d,s, 
is invariant under the flavor symmetry group SU;(3) if we can ignore 
the mass differences between m,,™mq and ms. In this case we can 
assign the u,d and s quarks to belong to the triplet (fundamental) 
representation of SU;(3) while all other quarks c,b and t as well as 
the gluons belong to the singlet representation of the group. Here we 
will discuss about the sector consisting only of the u,d and s quarks. 

First we note that according to QCD, all mesons (spin integer 
hadrons) can be built as color singlet states out of these three quarks 
and their anti-particles. Hence such an element (object) can be writ- 
ten as G,(x)qg(a),a, 8 = 1,2,3, where qq(x) denotes the ath quark 
feld. For example, qi(x) = u(x), qo(x) = d(x), q3(x%) = s(x). We have 
ignored the color index 7 = 1,2,3 of the quarks for simplicity (in a 
meson field, the constituent qq and q, fields have to have the same 7 
index or its sum for color neutrality). It follows now that the mesons 
bilinear in the quark anti-quark fields can either be singlets of SU;(3) 
of the form (repeated indices are summed) 


QalCiGale).. Hl, 2.5; (9.1) 


or belong to the octet representation of the form 


S%(0) =Tale)aae) — 56% (Qn), @6=123 (9.2) 


In contrast, we note that all baryons (spin half integer hadrons) 
consist of color singlet combinations of three quarks of the form 
do(x)ge(x)qy(x), a, 6,y = 1,2,3. As we have noted earlier, this ten- 
sor is not irreducible (see (5.94)). If we identify} | to represent the 3 
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dimensional (fundamental) representation of the SU;,(3) group, then 
the tensor Tag ~ daddy decomposes as (see (5.95)) 


| Jel jel j= e| |e ® , (9.3) 
|_| 


Algebraically we can write this also as 


38@383=10616868, (9.4) 


so that the baryons must belong to either the trivial (singlet) one 
dimensional or 10 dimensional (decouplet) or 8 dimensional (octet) 
representations of the SU;(3) group. 

Let us consider the octet representation denoted by Bag y which 
is specified by the Young tableaux symbol (2, 1,0) satisfying the con- 
straints 


Bapy = — Booey; 
Bapy + Bava + Byop =0, a,f,y=1,2,3. (9.5) 


Moreover we have already seen (see (5.81)) that any irreducible rep- 
resentation corresponding to the Young tableaux (fi, fo, fs) with 
fi > fe => fs is equivalent to that with (fi + A, fe + A, fs + A) 
for any integer A. Hence choosing A = —1, we see that the repre- 
sentation (2,1,0) is equivalent to (1,0,—1) which is described by a 
traceless tensor BY; a, 8 = 1,2,3 satisfying 


B°, =0, (9.6) 


just as in the bosonic case. The connection between the two is given 
explicitly by 


1 
B= 5 e Bua, 0B; v= 1,233, (9.7) 


as we have noted earlier also. Here «*” is the completely anti- 
symmetric Levi-Civita tensor. Since the use of B®; is more con- 
ventional for practical purposes, we use it hereafter to represent the 
octet of baryons, rather than the notation Byg, satisfying (9.5). 

Let us consider the Lie algebra su,-(3) of the SU;(3) group, given 
by the commutation relations between the generators (see (3.54) and 
(3.56)) 

[AY,, A%] = 0, AY + 57 AN, 0,8, u,v = 1, 2,3, (9.8) 


Vv 
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which also satisfy the traceless condition 
Al, = 0, (9.9) 


Furthermore the unitarity condition for the representation requires 
that the generators be Hermitian, namely, 


(AH,)T = A”. (9.10) 
If da(x) = (u(x), d(x), s(x)) for a = 1,2,3 represents the ath quark 
field, the creation operator, say for the u quark, corresponds to its 


conjugate field (q,(x))' = q'(x). Hence, we represent the basis vector 
for the qq(zx) field as |¢”), on which the generators A“, operate as 


1 
AM la") = 8% la") — 3 4 la"). (9.11) 


This simply describes how the vectors in the fundamental represen- 
tation transform under an infinitesimal SU;(3) transformation (see, 
for example, (3.64)). We can now readily verify that this operation 
is consistent with the su,(3) Lie algebra (9.8) 


LAS: A ]lq*) = AM,(A*|q°)) — A*, (A 19°) 
= Ar, (4a) — 55% la") ) — a4, (sla) — 55% 10) 
1 1 1 
— sa » BY _ > §H Xr — —6% §% |g) — —§# |q@ 
a, (sla) — porlar)) — 30%, (slat) — Sor la") 
SOU (05 Sse le ae eae se pla) 
3 3 3 
a cr (oy r 
= 6% 07 |g") — 6%, 0%, lq ) 
— 6H | 52 Xr 1 sa a 5 5% |gt! | su a 
eae} It) — 3 vig?) +07, elt) — 3 ACh 
2 (0, AN +8, A") Ig’). (9.12) 
Let us next consider the subgroup SU(2)@U (1) of SU;(3) corre- 
sponding to the standard isotopic spin and the hypercharge subgroups 
so that we can identify 
I,=A%, IL=A‘y, I3==>(A—- A’), 
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where (the three) Is denote the generators of the isotopic spin group 
SU(2) while Y corresponds to the generator of the hypercharge sub- 
group U(1). Using (9.8) we can now verify that these generators 
satisfy 


[feo J) OTe Tete) = ak, “(YEE = Ye iele 0.” . (9:14) 


Moreover, from the relation (9.11) it is clear that 


1 1 
I3|q') = 5 lq’), Islq?) = = lq’), Islq*) =0, (9.15) 
while 
1 1 1 2 1 2 3 2 3 
Yilq)= 3 la), Yl") = 3 la“), Yla’)= ~e lq”). (9.16) 


Recalling that gj = u,g2 = d and q3 = s, this implies that the 
(u,d) quarks belong to an isotopic spin doublet (J = 5) with Y = 5 
while the s quark is an isotopic spin singlet (J = 0) with Y = —3. 
Moreover according to the Gell-Mann-Nakano-Nishijima formula, the 
electric charge operator can be identified with (see (9.13)) 


Y 
Q=h4+—=A1, (9.17) 


Qlu) = Qla') = FI), Qld) = Qla?) = - 5a), 
Qls) = Qla°) = FIs). (9.18) 


Therefore, we conclude that the u,d and the s quarks carry the frac- 
tional electric charges 2,—3 and —4 respectively (in units of the 
magnitude of the electron charge). 


Similarly, for the eight baryonic state vectors |B°,), correspond- 
1+ 
Q°) 


ing to the octet of (traceless) baryon fields BS with J P= 3", we 
have (see, for example, (3.92)) 
A¥,|B%) = 6%, |Big) — 6 |BY). (9.19) 


Computing the actions of [3 and Y on such states we can identify the 
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states as 


3 
A=A°: T=0,Y =0, 1A") = -y/$15%), 
Y= (hte? B-)s 7 =1,Y =0, 
1 
V2 


The nontrivial normalization constants in the definition of states in 
(9.20) arise from the SU;(3) invariant normalization condition that 


J=*) = |B’), |E°) = >= |B — BY),|E7) =|B%). (9.20) 


1 
Bt B® = 6% ot ~ = of 6%3, a, B, pv = 1, 2,3, (9.21) 
Vy B vy’ B 3 vy’ B 


which also satisfies the traceless condition |B°,) = 0 (a@ is summed). 
With the normalization constants in (9.20) and using (9.21), we can 
readily verify that if we denote the octet of states collectively as 
|B;),7 =1,2,--- ,8, then they satisfy the orthonormality relation 


(B|B;) = 63, i,9 =1,2,---,8. (9.22) 


The electric charges carried by these states follow from (9.17) to be 
(already denoted explicitly in the specification of the states) 


id 1 
Qp aT 5 919 > Qn 919 , 
1. i.) 4 
s0 > er re = =>—_ -—- _ =-] 
Qe =5-5=0 Qs. =-5-5=-1, 
Qyo =0+0=0, (9.23) 
Qx+ =140=1, Qyo =040=0, Qy- =-14+0=-1. 


A flavor transformation belonging to the group SU;(3) of the 
form 


|B%) > |B%)! = a% af |B’), (9.24) 
induces a transformation on the states |B;) of the form 


|Bi) — |Bi)’ = Uis|B;), (9.25) 
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where Uj; represent 8 x 8 matrices and the orthonormality of the 
transformed states determines that the matrices U are unitary, namely, 


U'U = lg. (9.26) 


Let us now consider the branching rule for SU;(3) > U(2) ~ 
SU(2) ® U(1) in which the irreducible representation (1, f2, f3) de- 
composes as (see, for example, (5.123) and (5.124)) 


(fi fo, fs) =@ D> Oi, fa); (9.27) 
Sif 


for all (f{, f§) satisfying fi > f[ > fo > f > fs as we have already 
noted earlier. In this case, the sub-quantum numbers specified by the 
isotopic spin and J and the hypercharge Y can be identified with 


t=i(A-#), Y=A+h-2. (9.28) 


If we apply this to the original baryon field Bg, as described in 
(9.5) with (fi, fo, fg) = (2,1,0), then from the requirement f; > 
fi>fe> f= fs =2> fi >1> £5 = 0, we conclude that there are 
four combined possibilities for the values of f}, f5, namely, 


fe (9.29) 


Each of the four possibilities leads to the multiplets identified earlier, 
namely, 


(= 1: T=5,¥=1, asin (pn), 

(Fite) = 270)2 I=1,Y =0, asin (o*,d°, >>), 
(ey = ely I=0,Y =0, asin A®, 

(fo StL.0\¢ i= 00, as in (=",=°), (9.30) 


corresponding to the quantum numbers already discussed in (9.20). 
It is consistent to depict this octet as a hexagon diagram in the (I3, Y) 
plane as shown in Fig. 9.1. 

Similarly, consider the 10-dimensional irreducible tensor, de- 


scribing baryons with J? = sigas which decomposes as 


(3,0,0)=0 > (ff), 32> f, 20> f,>0, (9.31) 
fi .f3 
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Figure 9.1: Six of the octet of particles can be represented at the 
six corners of the hexagon in the Jz — Y plane with the other two 
naturally fitting into the origin with the right quantum numbers. 


which clearly determines f§ = 0 and fj = 3,2,1,0. The four solutions 
lead to the four multiplets (see (9.28)) 


Ge pra i30e f= a = 1, asin A= (A**, At, A®°,A-), 


Cet Se Pata HO ae Va, 
1 

Gps G04 T=5,Y=-l, asin e* — (5 =), 

Git = 00)? £Sa0.¥ ]—2> :asin O-: (9.32) 


We note from (9.28) that, since f§ = 0, in this case J and Y are 
related as 


Y 
[= Bi +1. (9.33) 
The decouplet of baryons can be accomodated on a triangle in the 
(13, Y) plane as shown in Fig. 9.2. We normalize the state vector of 
the decouplet |S°°7) for the totally symmetric tensor field Sapy as 


1 
VA| @aBy\ _ a sy B or 
(smrjgoer) — = > gnagregey, (9.34) 


permutations 


where the summation is over the (3!) permutations of the indices 
(a, 8, y) so as to maintain the total symmetric nature of the tensor 
state |S°°7). With this normalization, we can now identify the baryon 
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Figure 9.2: The decouplet of baryons can be arranged naturally on a 
triangle in the (J3, Y) plane with the right quantum numbers. 


states as 
1 
eae eee gt. Ries os gil2y NO eee gi22)_ 
Page” gh. Wee hari ten ae, ARE le 
es 1 
|A ee cae 

y*t gil3y yr) — 5123), aan oe 5223) 

)=e19™), Iv) = |8"%), fe) = 8%) 
= a =k 
Se abe ae = )= als), 
Q7) = ce (S729); (9.35) 


v6 


9.2 SU;(3) flavor symmetry breaking 


As we have noted earlier, if we ignore the mass differences between 
the (u,d,s) quarks, then the QCD Lagrangian density is invariant 
under the global flavor SU;,(3) group. In fact, all particles belonging 
to an irreducible representation of SU;(3) must have the same mass. 
Experimentally, however, the masses of the particles in the baryon 
octet, namely, p,n,=°,=7,A°,5+,°,- are slightly different from 
each other. The differences can be attributed to SU,(3) violating 
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mass terms in the Hamiltonian of the form 
H, = a(S", — $%) + BS*s, (9.36) 


where (see (8.89) as well as (9.2)) 
“3 = [es S%(x) 
= [ ax (daledaate) - 5 8% aNle\aale)), (9.37) 


and the coefficients a, 6 in (9.36) are given by (see (8.89)) 


a= 5 (my — ma), B= 


(2ms — Mu — Ma). (9.38) 
We note that S% in (9.37) satisfies the commutation relations (with 
the generators of the group) 


[Ay 83 = 07s 20 355, (9.39) 


which has the same form as the relation satisfied by the generators 
of the group (see (9.8)). We call any such tensor S% satisfying (9.39) 
as the adjoint tensor operator of the SU;(3) group. 

This is a generalization of the vector operators in the case of 
SU(2) group. For example, let us recall that the angular momentum 
Lie algebra for the SU(2) group is given by 


Be ug] > 1€:jkIk, 1,9; k= 1, 2,3, (9.40) 


with Jy = Jj tidy = A®,, J_ = Ji—iJp = At, and J3 = § (Al, — A) 
with A%,,a,b = 1,2 denoting matrix generators for SU(2). We know 
that any vector V = (Vj, V2, V3) satisfies the commutation relation 


Ba Vj] = 1€ijk Ve (9.41) 


which basically shows how any vector transforms under an infinitesi- 
mal rotation and coincides with the transformation of the generators 
in (9.40). In this case, the vector V is known as a vector opera- 
tor. Labelling the irreducible representations for states as |j,m) with 
m = j,j —1,---,-j + 1,— 9 for a fixed value of j (integer or half 
integer) we note that these states completely specify the vector space 
of the su(2) algebra. In this case, it is well known that we can write 


(j,m|V|j,m’) = a(9) (3, m|J|j,m’), (9.42) 
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where the value of the constant a depends on the vector V and j, but 
not on the azimuthal quantum number m. For simplicity, we write 
(9.42) symbolically as (even though it is not an expectation value) 


(V) = a(J). (9.43) 


An analogous theorem also exists for the su(3) Lie algebra. 
In this case, the main quantum number analogous to j (in su(2)) 
is (fi, fo, fg) while I,J3,Y correspond to the sub-quantum number 
m (of su(2)). For a given irreducible representation specified by 
(fi, fo, fz), the matrix element of the adjoint operator S® can be 
written (analogous to (9.43)) as 


1 
(9%) = a1(A%) + az(A°,A’, ae 6% AMA” ,), (9.44) 


which we will prove later. Here a, and a2 are constants which de- 
pend on the main quantum numbers (f1, fo, f3), but not on the sub- 
quantum numbers (J, 3, Y). Moreover, for any triangular irreducible 
representation which satisfies f; = fo or fo = fs we have 


a 1 a Vv Qa 


where 6 is a constant depending on the particular irreducible repre- 
sentation (fi, f2, fg). Therefore, using (9.44) and (9.45) we can write 
in a triangular irreducible representation, 


(5%) = ao (A%), (9.46) 


where ag is a constant. 

We are now in a position to explain the mass differences among 
the particles (states) in the octet and the decouplet representations. 
For this, we first ignore the small mass difference (my, — ma). This 
would imply the exact isotropic spin invariance so that masses of 
all the particles within a given isotopic spin J will be identical irre- 
spective of their J3 quantum number (projection). We can consider 
A, = 85°, in (9.36) as a small perturbation so that the mass of 
the particle with a given (1,Y) can now be computed for the gen- 
eral case as (the first term in (9.36) is proportional to [3, see (9.13), 
and, therefore, does not contribute to the mass difference within a 
multiplet) 


M(I,Y) = Mo + 6(S%s) + O(87) 


= My + 6 (aA) + aa(A8,a% — 5 AMAY,)) + O16"). (047) 
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In particular, for the triangular representation we can use (9.45) to 
write (ignoring higher order terms in 3) 


M(I,Y) = Mo + Ba(A®3) = Mo — Ba(Y), (9.48) 


where @ is a constant (whose value is not important for our discussion) 
and we have used the identification in (9.13). This leads to 


M(A) = Mo — 6a, M(Y*)=Mo, M(S*) = Mo + 8a, 
M(Q7) = Mo + 28a, (9.49) 


which yields the correct mass (difference) formula for the decouplet, 
nnamely, 


M(Q-) — M(&*) = M(&*) — M(¥*) = M(¥*) — M(A). (9.50) 


This mass (difference) formula for the decouplet agrees quite well 
with the experimental measurements of the masses of the baryons 
which leads to 


1672 — 1530 ~ 1530 — 1382 ~ 1382 — 1232, (9.51) 


where the masses of the baryons are expressed in terms of MeV (we 
are assuming c = 1). 
Let us now return to the general case and define 


Jy = AGA"; (9.52) 


which corresponds to the second order Casimir invariant of the su(3) 
Lie algebra. (We do not denote it by Iz as we have done earlier in 
order to avoid possible confusion with the isotopic spin generators.) 
Namely, it satisfies 


[A%, Je] = 0, (9.53) 


so that, in a given irreducible representation denoted by (fi, fa, fs), 
we will have a constant value for J» of the form 


Jo = (@+ 024+ @) -—2(& + 2+ 43) +1, (9.54) 
where 
f= fi+ 2, flo = fot, Lo Ta, (9.55) 


which we will show later. 
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We note that we can write the third term in (9.47) as 
AB, AY, — 2AM, AY 
A4* 3 3 v** Ub 
= A3, A, — = AM AY 42 AM AY 
— D cane ae 9 ve pe 6 ve 
1 . 1 
= A’, A’, i) (A*,A"s a A’, A”;) + 6 J2 (i, 3 =) 2) 


1 1 ; Pee Pe 1 
= 5434-5 (4.4%, + A‘ ,A’,) +h 


aS 
aN 
w 
w 
Nn” 
N 
| 


1 i La ae 1 


3 i perane ae | 
Aes AA ee 

D 3 D j it Gee 
3 
2 


x 


II 
Nl NILE wMIleHR 
PN 
a 
wo 
w 
ee 
ie) 
| 


eee 1 
Y¥ A’, A’. — Jo, 9.56 
97° 5 its 2 ( ) 


where we have used the commutation relations in (9.8), the traceless- 
ness of the generators A"), = 0 (see (9.9)) as well as the identifications 
in (9.13). Let us further simplify (9.56) by noting that 


= 5 Atal = -5 ((A4,) ae (A?,)? Ae ASA #. AtAS) 


1/1 iL 
=e (504 + A)? + 5 (At Moy ed le 11y) 


Se YS ll Te Ty, (9.57) 


where we have used the identifications in (9.13). Thus, substituting 
this into (9.56), we obtain 


1 Vv 
Ae AN = gt, 


1 3 1 
=—-Y?-P--V4 J. 9.58 
4 g° te? eee 
Noting that A®, = —Y (see (9.13)), we can write the formula in (9.47) 


as (taking expectation values in a state with a given (J, Y) quantum 
numbers) 


1 
M(I,Y) =Co+ CiY + Cy G y? -1(8+1)), (9.59) 
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where we can identify 


1 
Co = Mo + 6 Ba2Jo, 


Ci =—8 (a1 + 32] ; 
C2 = Baz. (9.60) 


For the triangular irreducible representation with f; = fo or fo = f3 
(see, for example, (9.33)), we have I = + +1 (and _ —I(I+1)= 
—3¥ _2) so that the general mass formula (9.59) for M(I, Y) reduces 
to the previous case (9.48), namely, 


M(I,Y)=Ch+ Cy. (9.61) 


For the baryon octet, the formula (9.59) gives (see also (9.20)) 


1 1/1 1 
M(W) = G+, +@ (3-5 (5 +1)] = Co mCi = Co 


B\9 
eae , aa ee ante 
M(=) =Co +e (5 5($+1)) =@ Ci 5 O2s 
M(A) =Co, 
M(X) = Co + C2 (0-11 + 1)) = Co — 2C2. (9.62) 


The constants Co,C , and C2 can be determined from the last two 
relations as well as a combination (difference) of the first two to yield 


Co = M(A), Cr= (M(N)—- M(&)), 


(M(A) — M(3)). (9.63) 


Furthermore, substituting these values for the constants into the or- 
thogonal combination (sum) of the first two relations in (9.62) leads 
to the mass formula 

M(N) + M(®) = = (38M(A)+ M(3)). (9.64) 
For the charge neutral members of the multiplets, relation (9.64) 
implies that 


M(n) + M(2®) = = (83M(A°) + M(2°)), (9.65) 


No] re 
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and this is quite well satisfied by the exerimental values for the masses 
(in MeV), namely, 


940 + 1315 ~ ~ (3 x 1116 + 1193). (9.66) 


1 
2 

Let us next consider the electromagnetic current operator in this 
sector (note the charge assignments for the quarks in (9.18)) 


Tem (&) = 5 (20) (w) 9" (@) — Go(@) 1" G2(a) — G3 (a)7"a3(@)) 


=e ((e)ran(a) ~ 5 tala)r"aa() ) 


SOV Gas (9.67) 
where we have defined the traceless current matrix 
Be Se Lee 
(V(x))"g = Ia(x)y"Ga(2) — 3° 8 dy (x)y¥"Gy(2), (9.68) 


with a, 8,7 = 1,2,3. This is an adjoint operator of the SU;(3) group 
and we emphasize here that we have ignored the contributions to the 
current from the c,b and the t quarks whose effects are expected 
to be very small. If we do add their contributions, the current will 
contain a SU,(3) singlet part in addition to the octet term, but we 
will continue working in the sector of the u,d and s quarks. 

Given the electromagnetic current operator, the magnetic mo- 
ment operator yu is defined by (the z component of the first moment) 


1 
15 [es Get dens) ay (9.69) 
so that we can write it as 


of the traceless adjoint operator 1% satisfying u%, = 0. So, we can 
calculate its eigenvalues (expectation values in specific eigenstates) 
much the same way as before (see (9.44)) and write 


1 Vv 
(Mt) = (wy) = a1(A4) + a2(At, AY — 3 AN), (9.71) 


where a1, @2 are two constants (different from those in (9.44)). For 
any triangular irreducible representation, following (9.47) and (9.48), 
we can again show that 


(u) = a4(A') = 24Q, (9.72) 
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where Q denotes the electric charge (of the state) and we have made 
use of the identification in (9.17) in the last step. This immediately 
leads to the relation 


w(AT*) = 2n(A*) = —2n(A~) = —2n(07), (9.73) 


among the members of the baryon decouplet. Unfortunately, only 07 
is semi-stable and only the value of j4(Q~) is known experimentally 
so that the relation (9.73) cannot yet be tested. 


For the J? = LY baryon octet, we can proceed as in the case 


of the mass formula (discussed earlier) and evaluate (jw). However, 
E 
Vv 


here we will use a simpler method to evaluate this. Let (B) os = 
be the anti-baryon tensor. Then, in a given octet representation we 
have 


2 flee: 
(u) = CB, BY - 5 BYB",) 
> pl i De pv 


for constants C1, C2. We note that (see normalization in (9.20)) 


1 
Bra 3 (BI, ras B*,) oye (Bi, = B’,) a B, (Bi, B*,) 
1 1 
aN ee 9.75 
G3 = Ja (9.75) 


so that we obtain from (9.74) 


1— 
p= (uh) = (Ci +2) (BBY, - 5 BBY) 


G8 5B 4B eB) CoB (By 4 BBs) 
ee ee | 1 
=(C1+0n)(((A AE +O) (Tao + BE) 


1 
5 (Dp +7n + AOA + OFEt + YON? 


+E-D7 + BE? +E-=°)) 


+C,(S-X7 + 2-57) + Cy (UtX* + pp)). (9.76) 
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Calculating this expectation value in definite baryon states we obtain 


w(p) = WE*) = Co— 5 (C1 +04), 


w(A°) = —w(B°) = 2 (C1 + C2). (9.7) 


Similarly calculating the matrix element in the states (5°| and |A°) 
states we obtain the transition magnetic moment in the decay ©° > 
A° + ¥ to be 

1 
2/3 
The Coleman-Glashow relations for the magnetic moments follow 
from (9.77) and (9.78), namely, 


pp( A? > >?) = (Cy + Cy). (9.78) 


w(ZT) + p(UM) = 2(Z°). (9.79) 
Experimentally, the values of the magnetic moments are known to be 


L(p) = 2.793un, p(n) = —-1.913 py, 
Dt) = 2.458+0.010uy, pu(Z-) = 1.160 + 0.025 py, 


( 
( 

p(B?) = -1.250+0.014yy, p(=”) = —0.6507 + 0.0025pn, 
( 


= 
> 

aod 
I 


—0.0613 + 0.004j1y,, 


|r (A° > D°)| = 1.61 + 0.08py, (9.80) 


where fry ~ 2.89 x 104 MeV/T (MeV/tesla) denotes the nuclear 
magneton. Equation (9.79) now determines 


p(X?) = = (u(E*) + w(Z7)) = 0.75 + 0.01 py. (9.81) 


The experimental values in (9.80) can now be checked to fit the 
relations in (9.79) fairly well except perhaps u(5~) = u(=_) and 
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u(n) = p(=°). However, we may note the following two points in 
this regard. First, the relation y(n) = 2u(A°) holds only when the 
electromagnetic current is purely an octet of SU;,(3). But, in reality 
the current contains a singlet term as well (as described earlier) of 
the form 


5 (2¢(x)y"c(x) — b(x)y"b(ax) + 2t(a)y"t(x)) . (9.82) 


Since the bare quark mass for the bottom quark is mp) ~ 1GeV, 
which is quite comparable to m, ~ 0.15GeV, we expect that the 
contribution of the singlet current to (A°) can be as much as 15%. 
Indeed if we assume that the electromagnetic current includes the 
SU;(3) singlet term, then we note (without going into details) that 
the relation y(A°) = 5u(n) will modify to 


sinOy = ; ((w(Et +57) + 4y(n)), (9.83) 


which improves the prediction over p(A°) = 5y(n). However, rela- 
tions such as p(~) = (EB), w(n) = (Z°) continue to hold shedding 
no light on larger violations. This suggests that we have to take into 
account the SU;,(3) violating contributions for these calculations. In 
such a case, the effective electromagnetic current will have a SU;(3) 
tensor structure of the form 713, in addition to T+, which leads to the 
relations 


— VBpur(A® + 3°) = 5(8m(A°) + (9) — (n(n) + H(E%)), 


wD) = S(u(B*) + ()), (9.84) 


The first of these two relations in (9.84) leads to pr(A? 4 %°) = 
—1.51y(n) improving the result compared with the earlier prediction 
of ur = —1.06p1(n) in the sense that it is now closer to the relation 
bir = —V3p(n) in (9.79). 

Although we cannot explain the large experimental violations 
of relations such as p(n) = p(E°) and (57) = p(Z-) (see (9.79)- 
(9.80)), we may argue as follows. Note that uy = Thtye the nuclear 
magneton, is the Bohr magneton of the nucleon. Since the magnetic 
moment is not dimensionless, we argue that the exact SU;(3) group 
theoretical relations should be applicable not directly to (B), where 
B denotes a baryon, but to a dimensionless quantity o(B) defined 
as 


eh 
= B 
IMac Ho(B), 


u(B) (9.85) 
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so that we have, for example, 


juo(r) = wo(E°) = 2u(A°),  uo(Z~) = wo (=). (9.86) 
This would lead to 
p(B) = ey p(n), => 125 =—-1.37, 
pe )= say p(n), = —0.61 = —0.80, 
(=")= T= 1S). S&S O65 ]—1,05, (9.87) 


which improves the results significantly over the predictions in (9.79) 
except for the last relation in (9.87). There is no clear idea regarding 
this discrepancy at the present time. 

The predictions of SU;,(3) symmetry are better for the weak lep- 
tonic decay modes of A°, Us and Es. However, since we have not dis- 
cussed about the electro-weak theory, we point out only the electro- 
magnetic self-energy effect in order to account for the mass difference 
between M(p) and M(n) as well as between M(=*) and M(~) etc. 
Note that the electromagnetic self-energy contains photon exchange 
diagrams, so that its SU,(3) tensor behaves like a complicated tensor 
Vii. Also since (m,—mqa)S} term (see, for example, (9.36)) can be 
written as (m,—mq)S}S1, both terms can be combined into a single 
tensor Tj. Therefore, the effective mass term has the form (see, for 
example, (9.47)) 


M = Mo + B(S3) + (T}?). (9.88) 


For the baryon octet, the same argument then shows that we may 
parameterize them as 


M = a, B°; B®, + a2B’, BY + a3B°3B, 
+ a481, BBY +0594 B"B!, +agB Bi, (9.89) 


in terms of six constants a1,@2,--- ,ag. Therefore, for the baryon 
octet, we can now have two mass relations of which one involves 
physically unobservable transition mass M;(A° + %°), so that. effec- 
tively there is one mass relation of interest. With some computation, 
it leads to the Coleman-Glashow mass relation given by 


M(S_) — M(8°) = M(=-) — M(=°)+ M(p)-— M(n), (9.90) 
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which compares quite well with experimental values (masses in MeV 
since c = 1) 


6.48MeV ~ 8.08MeV — 1.29MeV (= 6.73MeV). (9.91) 


For completeness we remark here that dynamical calculations of masses 
and magnetic moments have also been satisfactorily performed within 
the framework of lattice gauge theory. 

In ending this section, let us briefly mention 3 x 3 Gell-Mann 
matrices which are the SU(3) analogs of the 2 x 2 Pauli matrices, 


010 0 -i 0 1 0 0 
M=]1 0 0], A2=]i O OF, As=]O -1 O}], 
00 0 0 0 0: 02° 20 
001 0 0 -i 00:0 
M=1]0 0 0], As=]0 0 , AS*=]0 0 1], 
10 0 i 0 01 0 
1 

00 0 gee 0 
A= |0 0 -¢], A2=|9 Ze O |, (9.92) 

‘i 2 

0 2 O Os 0", 


which satisfy 
M=2;, Tr OiAj) = 26, 4,9 =1,2,3. (9.93) 


Introducing the real structure constants 
i 
figk = me Tr AgQrirj Aji) ) 
1 
p= i Tr (AgOAiAg + AGA) 5 (9.94) 


it is straightforward to verify that these structure constants are to- 

tally anti-symmetric and symmetric respectively under the interchange 
of any pair of indices 7, 7,k = 1,2,--- ,8. Moreover, it can be checked 

easily that the matrices in (9.92) satisfy respectively the commutation 

and the anti-commutation relations given by 


[ra Aj] = (Aid; =< Aji) = Qt faze Nk» 


4 
[Nis Aj]+ = (Aid; + Aj Ai) = 3 6: 1+ 2dijerAk- (9.95) 
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In particular, we note from the first relation in (9.95) that j,i = 
1,2,--- ,8 denote the 3 x 3 matrix representation of the generators 
of the SU(3) group. The explicit numerical values for the structure 
constants can be found in the book by M. Gell-Mann and Y. Neeman. 
Furthermore, if we define 


1 
Xi= 5 (Aa) ab A%, a,b = 1, 2,3, i= 1,2, aed 40s (9.96) 


then it follows, using (9.95), that 
[X;, Xj] = =[Cs)adA%, (Aj) ca Aa] 
(Ai) ab(Aj eal A% Aqal 


(Aa) ab(Aj ea (—O%GA%G + 5% A%a) 


ALR TR Ale 


1 Cc 1 a 
= 4 OApidoA’s + 7 OAs )aaANa 


1 Te a 
=A Dis Aslan A“ = A 2i fijk(Ak)abA% 
Sate GS (9.97) 


for a general matrix representation of X; . 


9.3 Some applications in nuclear physics 


The structure of a nucleus is generally modelled after the Fermi gas 
model or the nuclear shell model. As possible applications of group 
theory to nuclear physics, let us first consider Wigner’s U(4) multiplet 
theory. Suppose that the dominant nuclear force between nucleons is 
independent of the ordinary spin as well as the isotopic spin of the 
nucleons. Then the Hamiltonian of the system of interacting nucleons 
may be written in the form 


H=Ho+ Fu, (9.98) 


where Ho represents the part of the Hamiltonian of the system which 
is invariant under the U(4) group of transformations, 


wy =Ud, v=] 1, (9.99) 
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where U denotes an arbitrary space-time independent matrix belong- 
ing to U(4) and py, p|, n+, ny refer to the spin up and spin down states 
of the proton and the neutron respectively. On the other hand, the 
Hamiltonian H; may violate the U(4) symmetry and its contributions 
are assumed to be small compared to those of Ho. In such a case, 
we can classify the nuclear states by the irreducible representations 
of the U(4) group and the U(4) violating part of the Hamiltonian Hy 
can be treated as a perturbation as we have discussed in the case of 
SU,(3) symmetry of the quark model. Unfortunately, such a U(4) 
model does not turn out to be realistic (does not agree with observa- 
tions) and, therefore, we will not go into the details of this model. 

As another nuclear model, let us consider the shell model in 
which we have n singlet particle levels (or shells) with the energy 
of the level €,,4 = 1,2,---,n. Let us assume that the pth shell 
is further labelled by another micro quantum number p such as the 
angular momentum or spin although we will not specify here what 
p represents. Let a, with up = 1,2,---,n,p = 1,2,--- ,& denote 
the annihilation operator for a nucleon in the state specified by the 
quantum number (j,p) and that the Hamiltonian for the system is 
given by (repeated indices are summed) 


= Gal Gi + Creat) tiie (9.100) 


where a, 8, u,v = 1,2,--- ,n while p,q = 1,2,--- ,¢. (Here Gop isa 
constant tensor.) If we now identify 


X% = ah p89 (9.101) 
then the Hamiltonian in (9.100) can be rewritten as 

Se og ee: (9.102) 
Moreover, since dq,» and al; . satisfy the anti-commutation relations 

Qa,p4B,q + A6,q@a,p = 0 = aNsahs + eee 

dap , +a), ap = Sap5pqs (9.103) 


it follows that X% satisfy the commutation relations of the u(n) Lie 
algebra (see (3.50)) 


[xh XG] = OE XS, + OU Xs, (9.104) 
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as well as the unitarity condition (see (9.101)) 
t 
(xt)! = (4), par.) = ee = X"),. (9.105) 


Here p,v,a, 0 =1,2,--- ,n. 

Let us note that the Hamiltonian H does not commute with 
X*, in general so that it is not invariant under the U(n) group of 
transformations. However, it does commute with all the Casimir 
operators J, = 1,2,--- ,n of the u(n) Lie algebra given by 


Like 


Oe RHI ie, RR Re (9.106) 


and so on. In fact, if we think of X", as matrices, then the Casimir 
operators can be written as 


Jog = Tr X%, (9.107) 
and it follows from the commutation relations (9.104) that 


[Ja, X*] = ise, Gare. 24 


Vv 


Came ec nC sama 
(xermy* (5, XH + Ox!) (XM), 


Gee ee Cerny ss | 


EYP l= 0, (9.108) 


which shows that these are indeed the Casimir operators for the Lie 
algebra of u(n). Furthermore, using (9.108) it follows immediately 
that 


[Ja, H] = 0, (9.109) 


and (Ja, H) define a set of commuting operators which can be diag- 
onalized simultaneously. 

In other words, we can classify the eigenstates of H by the ir- 
reducible representations of the u(n) Lie algebra. Since any d x d 
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irreducible matrix representation of the u(n) Lie algebra can be read- 
ily constructed, we can express H as a d x d matrix which can then 
be diagonalized to determine its energy eigenvalues. (Note that any 
irreducible d x d matrix representation of u(n) can be specified by n 
eigenvalues corresponding to the n Casimir invariants.) As a simple 
illustration, let us make the (rather drastic) assumption that all the 
eigenvalues are degenrate, 


j= Q ==: =e = €, 
gh = 916453 + 926564 + 936" bag, (9.110) 


where gi, 92,93 are constants. With these, the Hamiltonian (9.102) 
takes the form 


H = €X", + (g16K63 + 92550% + 936" bag) X9,X"%, 
=i + (Ji)? + gode + g3X" XM, 
= Fo + 93X4,X", (9.111) 
where we have identified, for simplicity, 
Eo = €J, + gi(J1)? + goJe. (9.112) 
Furthermore, let us note that if we define 
Tig A To. (9.113) 


then using (9.104) it is straightforward to verify that it satisfies the 
Lie algebra of so(n) and, therefore, corresponds to the generators of 
the infinitesimal rotations in n-dimensional space. It follows from 
(9.113) that 


Dad XY MR 2) 
=2 (xe ae Cee) ’ 
1 

or, X'4X4 = Jot 5 lide) pints (9.114) 
As a result, we can write the Hamiltonian (9.111) also as 

| ene (9.115) 

—~ “40 2 YY LV» ‘ 

where we have identified 


Eo = Eo + 932. (9.116) 
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We note that 
1 
C2 —= 5) Juv dus (9.117) 


corresponds to the quadratic Casimir invariant of the so(n) sub-Lie 
algebra of u(n) and commutes with both J; and Jz. As a result, the 
energy eigenvalues of H are now completely determined by the values 
of Jj, Jo and Cy. However, these models are not realistic enough. 
Perhaps the most successful nuclear model is based on the interacting 
boson model where the nucleus consists of nucleon pairs tightly bound 
to each other behaving effectively like bosons. As a result, the nucleus 
(with an even number of nucleons) may be regarded as a boson gas 
(see Iachello et al.). 

Another interesting topic in nuclear physics is known as the 
nuclear-boson expansion theory in which we can express products of 
two fermion operators such as a,,,)@),q a8 a polynomial of some boson 
operators to justify replacing fermionic systems by bosonic ones. One 
of the related ideas in this regard is the Holstein-Primakoff represen- 
tation of the su(2) Lie algebra. Let us write f = 27 = 0,1,2,3,--- 
where j denotes the angular momentum quantum number charac- 
terizing a state. Let a and a! denote two bosonic annihilation and 
creation operators satisfying the commutation relation 


[a,a'] = 1. (9.118) 
If we now define 

Jy = I + ido =al(f —ata)?, 

J_-= J, -iJg =(f - ala)2a, 

ye eee (9.119) 


then it can be directly checked that they satisfy the su(2) Lie algebra 
with 


Pata ee Soa 


1 
2 
a vi 

=5 € p i) ; (9.120) 


This clearly works if we restrict ourselves to the sector f > ala > 0. 
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CHAPTER 10 


Casimir invariants and adjoint operators 


To start with let us consider, as an example, the u(NV) Lie algebra 
given by 


[x X%3] = 0 XS ++ OVA 6 (10.1) 
for u,v,a,0 = 1,2,---,N. In the universal enveloping algebra of 


u(N) or in some matrix representation of u(N), then the N x N 
traceless matrix S“; defined by (see section 3.2.3) 


1 
B= T3- Hy 


will also satisfy the commutation relation 


oe (10.2) 


[X4,, S%] = —6%S% + OV S"s, (10.3) 
where 
[x*,, T%] = 0,1") + a hae (10.4) 


We have called such an operator an adjoint operator (sometimes also 
known as a vector operator) in the earlier chapter. This notion can 
be generalized to any Lie algebra L satisfying (we are assuming that 
the generators X, are Hermitian) 


Kay Xp] S1C Xa (10.5) 


where a, $8,y = 1,2,--- ,M and M = Dimension of the Lie algebra 
L. For example, for u(N), we know that M = N?. Furthermore, we 
recall that 


Os has nfl 
Cog = Chan 


are known as the structure constants of the algebra. In this case, any 
operator Sq satisfying 


[Xa, 5p] = 107,85, (10.7) 


(10.6) 
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is known as a covariant adjoint operator of the Lie algebra L. Ina sim- 
ilar manner, we can define a contravariant adjoint operator through 
the relation 


Xe) = 702 2, (10.8) 
It now follows that any quantity of the form 
L=Hal). or, LSP" Sa; (10.9) 
will satisfy (say, for the first form in (10.9)) 
[Xs 1] = (Xo, SeT"| = [Xa SelT? + SalXa, TP] 
= iC} 4ST" — iC8,SgT? = iC1,(S,T? — SyT*) 
=0. (10.10) 


Here we have renamed the indices in the second term in the inter- 
mediate step. This shows that the generators of the Lie algebra will 
commute with quantities of the forms in (10.9) and, consequently, 
such quantities define Casimir invariants of the Lie algebra L. 

Let us assume from now on that there exists a symmetric (second 
rank) constant tensor (metric) gag = Ysa (see (3.111) and (3.115)) 
which satisfies 


Ges Cay fay (10.11) 


with fxg, denoting the completely anti-symmetric structure con- 
stants of the Lie algebra. Furthermore, we assume that the metric is 
nonsingular so that the inverse g’ exists and satisfies 


Gayg? = 68. (10.12) 


As we have noted earlier, these conditions hold for any reductive 
Lie algebra, namely, the direct sum of semi-simple and Abelian Lie 
algebras. For such a case, given a covariant adjoint operator Sy, we 
can always obtain a contravariant adjoint operator S® as 


Se ga: (10.13) 


As a result, the Casimir invariant (10.9) of the Lie algebra L can now 
be written as 


P= Sel? SG? Sats: (10.14) 
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In particular, if T° = S°, this takes the form 
LS = Go" $.8e—Se8" = 8° Se (10.15) 


and would correspond to a quadratic Casimir operator for Sg = Xq. 
In general, if we can write (g°°2""°?-! is a constant tensor which is 
a generalization of the metric tensor) 


Te = Gree eT Nas see me, (10.16) 
then the Casimir invariant in (10.9) with Sg = Xq can be written as 
Ip = Ge “Oe XN Ss aa Xap; (10.17) 


would define a pth order Casimir invariant of the Lie algebra. 

To be specific, let us assume that we are considering an irre- 
ducible matrix representation space p of the Lie algebra L. Then, it 
will be shown below that since the Casimir invariants commute with 
all the generators of the group, they must be constant multiples of 
the unit (identity) matrix 1, namely, 


I=1(p)1, (10.18) 


where the constant I(p) corresponds to the eigenvalue of J in the 
(representation) vector space V = p. To see this more directly, let us 
look at the eigenvalue equation 


19 = X0, (10.19) 
which has a nontrivial solution if the characteristic equation 
det (I — \1) = 0, (10.20) 


is satisfied. Furthermore, since J commutes with all the generators 
(see (10.10)), it follows that 


Xealle=OGl = 16 <6: 
or, I(Xqv) =A(Xaqv). (10.21) 


More generally, if P(Xq) denotes an arbitrary function of the gener- 
ators Xq, it would follow from (10.21) that 


I (P(Xa)v) = \(P(Xq)v) - (10.22) 


This would imply that if v is an eigenstate of J with eigenvalue 4, 
then (P(Xq)v) is also an eigenstate of J with the same eigenvalue 
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for any arbitrary function P(X.). Let Vo denote the subspace of the 
vector space V spanned by vectors of the form P(Xq)v. This would 
imply, in particular, that XgP(Xa)v C Vo (since we can think of 
P'(Xq) = XqP(Xq)). Namely, Vo would correspond to an invariant 
subspace of V. On the other hand, by assumption V is an irreducible 
vector space and if Vo contains v ~ 0, we must have Vo = V so that 
any vector v € V is an eigenvector of J with the same eigenvalue 
which implies that 


I=Al1. (10.23) 


Denoting A = I(p) we obtain (10.18). 


10.1 Computation of the Casimir invariant /(,) 


Let {po} denote the generic irreducible representation of the Lie alge- 
bra L while {9} represent a fixed irreducible representation which 
we call a reference representation. We will choose {9} to be the sim- 
plest, namely, the lowest dimensional irreducible representation for 
a reason to be explained shortly. Let Xq and xq denote the matrix 
representations of the generators in {p} and {po} respectively. Let 
us consider the tensor product {p} ® {po} and define 


Yo = Xq @19+18 2a, (10.24) 


where 1 and 1 represent the identity matrices in {p} and {po} re- 
spectively. We note that 


[Yo, Ye] = [Xo @ lo +18 rq, Xp 8 19 +1 zg] 
= [Xq ® lo, Xg ® 1p] + [Xa ® 1p, 1 ® x6] 
+ [1 @ tq, Xg @ 1p] + [1 @ rq, 1 @ zl. (10.25) 
The individual commutators are easy to work out. 
[Xa @ lo, Xp @ lo] = (XaXg) ® (Lolo) — (XeXa) ® (Lolo) 
= ([Xa; X]) @lo= iC g (Xy ® lo) ) 
[Xq ® 1o,1® xg] = (Xq1) @ (1oxg) — (1X4) ® (x10) 
= Xq ® 2g — Xq® rg = 0, 
[1 ® tq, Xg @ 1p] = (1Xg) ® (alo) — (Xg1) ® (talo) 
= Xg 8% — Xg@la = 0, 
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[1 ® xq, 1 ® xg] = (1 x 1) @ (taxg) — (1 x 1) ® (wgxq) 
=1@ [t,x] = iC}, (1 @ 24). (10.26) 


Here we have used the commutation relations in (10.6) and using 
these in (10.25) we obtain 


[Yo, Ya] = iC], (Xy @ lo +1 @ zy) =1C},Y,. (10.27) 


Therefore, Yq in (10.24) also satisfies the same algebra as the gener- 
ators in (10.6). 

As we know we can raise the indices by applying g°? so that we 
can write 


» 9°? Xp, c= 9° xe, (10.28) 
and so on. This allows us to define the operator 
C= Xp. 82" =X? Oa, = 9 Xn O Tu: (10.29) 
which has no indices. We can now calculate 
[Ya, Q] = [Xa @ lo +1®@ za, Xp @ x*] 
= |Xo;X4| 8 ah + Xp ® [ta, 2°] 
= (iC2,Xy) ® x? + Xg® (-iC8,27) 
= iY, (X, 20° - X, @2") =0, (10.30) 
where we have used (10.9). In particular, if 


Q? = QQ---Q, (10.31) 
—SS—4J/ 


p times 
denotes the pth matrix product of Qs, we have from (10.30) 
[Yo,@?] =0, p=1,2,---. (10.32) 
This can be rewritten as 
[Xq @ lp +18 rq, XB, Xp, ° --Xg,® Pig? . .. gPP| =0, 
or, [Xo,Xg,Xg,---Xg,] @ 2a -.- oF 


+ X,Xg.---Xg, @ [za, 2a --- 2] = 0. (10.33) 
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We can now take a partial trace Tr,, of both sides of the equation only 
over the reference representation pg (and not over the representation 
space p). Then the second term in (10.33) will vanish (the trace of a 
commutator vanishes) and defining 


A bee Cage - 26>) = gh P2"Pe (10.34) 
the partial trace yields 
[Xan ght2"Bo Xp, X pq ---Xp,] =0, (10.35) 


for any Xq € L. In other words, we can identify the pth order Casimir 
invariant of the Lie algebra L with 


is = gree ep Nas sia Xap; (10.36) 
so that we can write 
Ip = Ip(p) 1, (10.37) 


for all possible eigenvalues J,,(p) of the Casimir invariant. 
Furthermore, since by definition 


GQ? = Bay Nay Rae Oa ee ee, (10.38) 
the complete trace (trace over both the spaces) leads to 


MQ? =} Te (Neha hea Des ee es ae) 

= gM Pr Te, (Xa, Keg XG 

ST Ge ee Dageae, 

= Trply = Ip(p) Tep 1 = d(p)Ip(0), (10.39) 

where d(p) = Tr, 1 denotes the dimension of the irreducible repre- 


sentation space {9}. Here we have used the definitions (10.34) as well 
as (10.36) in the intermediate steps. It follows now that we can write 


aa es 
~ d(p) 


and, therefore, to determine the Casimir invariant, we only need to 
calculate Tr Q?. 

This can be done as follows. Let us assume that LD is a reductive 
Lie algebra and hence {p}®{po} can be decomposed into a direct sum 


Ip(p) Tr Q?, (10.40) 


10.1 COMPUTATION OF THE CASIMIR INVARIANT I(p) 251 


of n irreducible representations {p;},j7 = 1,2,--- ,n as (see section 
2.4) 


{0} @ {oo} = >> {3}. (10.41) 
j=l 


In this case, we can write 


Y, = Xq@1p+18 ro 


See a (10.42) 
j=l 


where x¥ ) is a function of Xq while P; (which contains the rq de- 
pendence) denotes the projection operator onto the space {p;} so 
that 


n 
SOP; =18 lp. (10.43) 
j=l 


In other words, P; can be written in the block diagonal form (Ej = 
1;) 


P= (10.44) 


0 


For example, denoting the second order Casimir invariant by Io, 
we have 


gPXOXD = Io(p;) at for fixed j = 1,2,--+ ,n, 
g°? X.Xp = Ie(p) 1, 
aB 


9" Laxp = I2(po) Lo, (10.45) 
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so that we can rewrite (see (10.29) as well as (10.42)) 
Q =, ge? Xe & LB 


1 


= 5 D209) Pj — 5 (Lalo) + 12(0)) 1.8 Bo 


1 
= 5 S5 (he(p;) — Lop) — I2(p0)) Pj, (10.46) 
where we have used (10.43). For simplicity, let us denote 


& = 5 (ale) — Lalo) — La(p0)) (10.47) 


so that we can write 
n 
Q= SGP. (10.48) 
j=l 
It follows now that (remember that P; is a projection operator) 
n 
Q? = pa (€;)? Pj, (10.49) 
j=l 


and the trace leads to (see (10.40)) 


TOP SS (Gy ie =) Ga, 
j=l j=l 


or Wyle) = Fy BO = YG) Alo. (10.50) 


Therefore, if we know the Clebsch-Gordan decomposition law for 
{p} ® {po} = ay @{p;}, we can calculate the pth order Casimir 
invariant [,(~) for any generic irreducible representation {p} since 
the eigenvalues [2(p;) of the second order Casimir invariant can be 
easily computed. 

As an illustration, let us compute J,,(p) for the Lie algebra u(V). 
In this case the dimensionality of the group is given by M = N?. Let 
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the generic irreducible representation {p} be labelled by the Young 
tableaux symbol 


{oe} = (fir fay fw), fie fe2---> fn, (10.51) 


and we choose the reference representation to correspond to (the fun- 
damenta representation) 


{po} =|] = (1,0,0,--- ,0). (10.52) 


In this case, we can decompose the product of representations {p} ® 
{po} as (see section 5.4) 


{e} ® {po} = (fi, fa,--- fn) @ (1,0,0,--- <0) 
= (fi +1, fo,..., fw) ® (fi, fa +1, fa,-°> fn) 


Coe (ig pine ds (10.53) 
If f; < fj41 +1, we can drop the representation 
Chi doe aide PL ia (10.54) 


since it violates the condition required for a Young tableaux (see, for 
example, (10.51)). 

Furthermore, Weyl’s dimensional formula (see, for example, the 
book, ”Group theory and quantum mechanics”, by H. Weyl) gives 


1 
d(p) = ana). =D) 1G — ly). (10.55) 


where ¢; = f; +N —j,j = 1,2,---,N. Note that by construction 
t) > lg >--- > ly. Moreover, we can calculate I2(p) as 


N 
Too) = 92 Gj(6) —N +1) + : N(N —1)(N —2), (10.56) 
j=l 


which we will show later. Then, €; can be computed using (10.47) 
and hence J,(p) from (10.50). We can, therefore, determine the pth 
order Casimir invariant to be 


N Ne Fi 
I,(p) = £;\P 1+ 
@= LO) II'( —z) 
N N 
= ye? pl+& — (10.57) 


254 10 CASIMIR INVARIANTS AND ADJOINT OPERATORS 


where the prime implies excluding the term 7 = k in the product. 

In spite of its fractional representation in (10.57), Ip(p) is really a 
pth order symmetric polynomial in ¢1, €2,---@y. In order to show this, 
let us assume that z is a complex variable and consider an analytic 
function 


N N 
fe=]] (1 a =) =|] qo (10.58) 


k=1 


N 
Res f(z = ¢;) =—]] (10.59) 


and f(z) + 1 for z + oo. Therefore, we can rewrite f(z) also as 


1 ta Dire x (10.60) 


j=l 


with the coefficients determined from the residues to be (see (10.59)) 


a +h G 
b = ||] ——. (10.61) 


N : co ON .\P ph. 
f(z) =1 ee! Se 


(oe) 
=1]- Ss" oe (10.62) 


where we have used the relations in (10.57) and (10.61). On the other 
hand, rewriting the function f(z) in (10.58) near z — oo as 


N ioe) 
fi) =] (1 = x) =]J{1->> ny (10.63) 


and comparing with (10.62) we can now compute I,(p) for lower 


10.1 COMPUTATION OF THE CASIMIR INVARIANT I(p) 255 


orders as 


iw N 
nio)= o- (4) =fitfot---+fn, 
=i 


N 
Th(p) = So fale — N +1) + io): 


k=1 
N N N js 
(0) = (4) - “—* Ste)? - 5 (> ts) 
k=1 k=1 k=1 
N 
-- 5 NN -1)S04&- ) (10.64) 


k=1 


and so on, where we have identified 


(;:) = raat (10.65) 


with the binomial coefficients. 

We can rewrite the Casimir invariants J, in terms of the gener- 
ators of the group X‘, as follows. Let us write x%,a = 1,2,---,M= 
N? as x%, where a,b = 1,2,--- ,N. Thought of as N x N matrices, 
its elements are given by 


(a) ep Ce Vos i Opa. Os Oy Chel, tae aN, (10.66) 
which leads to 
(0%) og = (2 a) be (2a)eg = 0°,0 90g = 6% (2a) nq . (10.67) 


It follows now that these indeed satisfy the commutation relations 
(10.1) of the Lie algebra, namely, 


[x%,, 2° 4] = —d°% x" + 650%). (10.68) 
Furthermore, using (10.67) we can calculate the metric 

gS Te (ea?) = Tr (2% 2°,) = 07,05, (10.69) 
as well as 


gr = Te (0% 20%) = SOR, (10.70 
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As a result, we have (see discussions around (10.29)-(10.37)) 
Ty = Tr (2%) X% = 64X", = X%, 
Tn = gg Xq Xp = 56 XX = X4X 
Ts = g°?"XqXpXy = 04,05, OR X a1 X aX ay 
= X4X? Xe, (10.71) 


and so on. We can similarly compute eigenvalues of higher order 
Casimir operators for the Lie algebras so(N), sp(2N), G2 (for details, 
see S. Okubo, J. Math. Phys. 18, 2382 (1977)). In the next sec- 
tion, we will enumerate all adjoint operators in a given irreducible 
representation which has been solved in the above mentioned paper 
and has been utilized in the derivation of the SU,(3) mass formula 
discussed in the previous chapter. 


10.2 Symmetric Casimir invariants 


One problem with the Casimir invariants I, of the u(N) Lie algebra 
discussed in the last section is that they are not unique. For example, 
let us consider the second order Casimir invariant Jy. It is clear that 
we can always define a family of second order (in the generators) 
invariants of the form 


Ih =In+c(h)’, (10.72) 


where c is an arbitrary constant. The nonuniqueness in the defintion 
becomes much more serious for higher order Casimir invariants. For 
I3, for example, we can have a two parameter family of invariants of 
the form 


$—Ip+c(h)? +ohh, (10.73) 


for arbitrary constants c; and cg. 

This question of nonuniqueness can be easily fixed if we consider 
the su(N) sub-Lie algebra of u(N). More generally, let ZL denote a 
simple (or, in general, semi-simple) Lie algebra. Then we can show 
that it does not have any first order Casimir invariant in the following 
manner. Suppose that we have a first order Casimir invariant 


Ty = £°Xg, (10.74) 
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for constants €°%,a@ = 1,2,---,M where M = dimL denotes the 
dimension of the Lie algebra. Being an invariant, J; commutes with 
all the generators (see (3.103) for the definition of the adjoint action) 


ix el =, a=1,2,---,M, 
or, ad(lh)Xq =0. (10.75) 


Using (10.6) and (10.75), it follows that for any a, 3 =1,2,---,M 


[1 [Xa, Xe] = 0, 
or, ad(I))ad(Xq)Xg = 0, 
or, ad(Iy)ad(Xq) =0, 
or, Trad(Xq)ad() = 0. (10.76) 


On the other hand, by definition (10.74), 

ad (I,) = ad (Xa), (10.77) 
so that (10.76) leads to 

€°Tr ad (Xq)ad(Xg) = 0. (10.78) 


Since Xq belongs to an arbitrary representation, we can choose it in 
the adjoint representation to obtain 


Gopt” = 0, (10.79) 


with ggg = Tr(ad(X,)ad(Xg)) (see (3.111)), which implies that 
€* = 0,a = 1,2,---,M since for any semi-simple Lie algebra the 
metric is nonsingular (has an inverse). This proves that J; = 0 for 
any semi-simple Lie algebra. We will show shortly (see also (3.111)) 
that for any irreducible representation {p} of V we have 


d(p) 
(Pad; ) 


where d(p) denotes the dimensionality of the irreducible representa- 
tion {p} while d(paqj) = M corresponds to the dimensionality of the 
adjoint representation of L. As a result, the types of nonuniqueness 
problems associated with the u(NV) Lie algebra described above do 
not arise in any semi-simple Lie algebra simply because J; = 0. How- 
ever, we note here that even for the SU(N) group, if we consider the 


Tr X,Xy = 13(p) ga (10.80) 
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fourth order Casimir invariant I4(p), we can always add to it lower 
order invariants of the form 


Ih(p) = La(p) +e (L2(0))?, (10.81) 


for an arbitrary constant c. I/(p) will also denote a fourth order 
Casimir invariant and we will discuss below how such non uniqueness 
can be removed from the definition of higher order Casimir invariants 
in a systematic manner. 

There may be other types of ambiguities present in the definition 
of higher order Casimir invariants. Let us consider the third order 
Casimir invariant given by 

Iz = gPIX,XpXy, g°% = Tr (ea? x), (10.82) 
where by cyclicity of the trace we have 

gr? = gh = gee: (10.83) 


Let us decompose g®*7 into totally symmetric and totally anti-symmetric 
parts as 


gery — Arey 4 ‘mae (10.84) 


where (using (10.82) it is easy to see that they are respectively totally 
symmetric and anti-symmetric in all the indices) 


posby — ; (0 Ze 9°) = si ((222" + nen!) 


1 


1 
for = 5 (00 — 9°) = 6 Tr ((2%2" — x°a*)x") . (10.85) 


Therefore, the third order invariant in (10.82) can be decomposed as 
Tg de ee (10.86) 
where 
TSX Np doy 
De fon KR (10.87) 


Moreover, using the anti-symmetry property of f°’, we can reduce 
a into a lower order Casimir invariant as follows 


1 
1 = FORUM Xe, _ 5 fh Xa, Xp] Xy 
1 ; 
=5 foby (iC35Xs) Xe 


i 
= 59" ( per fast) Kp Ky (10.88) 
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Furthermore, using the symmetry property as well as the definition 
of the structure constants we note that 


fapr = frag = Crago = (ad (ar))2 G6 = (ad (2r))ag, (10.89) 
and, similarly, 
fer a aft Sade)" (10.90) 
which leads to 
f°? fore = — (ad (#7))?* (ad (27) gg = —Tr (ad (x)ad (x,)) 
= —Ip(pagj) 5, (10.91) 


With this we can now write (10.88) as 
I; = 5 Tx(Paaj) 9° 51 X5Xy 


1 1 
eee To(paaj) 9°" XsX7 = “5 To(pag)L2, (10.92) 


so that we can write 
_ae= 5 12(Paay)I2- (10.93) 


Clearly the lower order invariants in (10.93) can be eliminated 
if we define the Casimir invariant in terms of the symmetrized struc- 
ture h°97, However, even when we do use a symmetrized structure 
g°!°2"" in the definition of the Casimir invariants, there may still be 
nonuniqueness in the definition at higher orders. For example, for 
the fourth order Casimir invariant we have 


Ta GPP XX aX ka; GP? = Tr ea ce"); (10.94) 


As before, it can be shown that this invariant contains lower order 
invariants as in (10.93). Introducing the completely symmetric struc- 
ture 


pobre — a Dg, (10.95) 


where “P” denotes summing over all possible (4!) possible permuta- 
tions of the indices a, 8,7y,6 and defining as before 


Le SOO AX Ae (10.96) 
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it is clear that the lower order terms coming from the anti-symmetric 
parts of the structure g%°7 will not be present in (10.96). However, 
we note that 


Robe ep Pe = Arr + ¢( gA gr? + g%7%gF5 + gree), (10.97) 


is also a completely symmetric structure for any arbitrary constant c 
(namely, h2 is not unique) and would generate lower order Casimir 
invariants in the definition of I ; This ambiguity can be avoided by 
requiring that the symmetric structure satisfies 


= 6 
ne”? gaggy5 = 0, (10.98) 


so that the lower order invariants will not be present in the definition 
of I4. Therefore, we conclude that we can define J, with a symmetric 
structure 


Iy= 1) = 7° 


XqXpX+1X65, (10.99) 
where ra is completely symmetric in all the indices and satisfies 
(10.98). As we will indicate shortly, this form of I, leads to the unique 
(up to a multiplicative constant) fourth order Casimir invariant of any 
simple Lie algebra except for the case of so(8) Lie algebra where there 
are two linearly independent fourth order invariants. 

For higher order Casimir invariants we can do something similar. 
For example, for the sixth order Casimir invariant of the form 


Tes gee Xa XgXyhiXokes (10.100) 


if we require 


(7) g°?°r to be totally symmetric in all the indices, 
(i) gg .5q,5Gor = 0, 
greet peepee =O, 
GPO Gigs Gini 0, (10.101) 


then it follows that Jg, if not trivial, would be independent of lower 
order invariants Iz, [3 and I4. Here gag+, gasys denote respectively the 
completely symmetric third and fourth order tensor structures. We 
point out here that the three conditions in (77) in (10.101) imply (in 
some sense) the absence of respectively terms of the forms (I2)°, Io 14 
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and (J3)? in the definition (10.100) of Ig. This construction can be 
generalized to obtain any higher order symmetric Casimir invariant. 
Without going into technical details, we simply note here that 
the classification of algebraically independent Casimir invariants is 
known in mathematical literature to be as follows: 
1. For Lie algebra A, = su(n+1),n > 1: 


Ig, Is, dagdss es ae one 


2. For Lie algebra B, = so(2n+1),n > 2: 
Io, Ig, Ie, PON son. 


3. For Lie algebra C,, = sp(2n),n > 3: 
Ip, Ig, Ig, eat ) on. 


4. For Lie algebra D,, = so(2n),n > 4: 
Tp, I4, 16,-++ 5 L¢—1) and in (We will explain ia shortly.) 


5. For (exceptional) Lie algebra Go: 
Iz, Ig. 


6. For (exceptional) Lie algebra Fy: 
hy, I, Ig, Tio 


7. For (exceptional) Lie algebra E¢: 
I, Is, Ig, Ig, Ip, Iho. 


8. For (exceptional) Lie algebra E7: 
To, I6, Is, Tio, fiz, fia, Lis. 


9. For (exceptional) Lie algebra Es: 
Tp, Ig, Ti, Lia, is, 120, L2, 130. 


Note that the total number of Casimir invariants for a simple Lie al- 
gebra coincides with the rank of the Lie algera. We did not consider 
here the Lie algebras so(4), so(6) and sp(4) because they are equiva- 
lent to so(3) @ so(3), su(6) and so(5) respectively. Also we note that, 
for a given Lie algebra as described above, I, is unique except for the 
case of Dz, = so(4m) where it has two independent Casimir invari- 
ants at the (2m)th order which we denote as Ig, and Ig. However, 
there exists a well defined individualization of these invariants (see, 
for example, S$. Okubo and J. Patera, J. Math. Phys. 25, 219 (1984)). 
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10.3 Casimir invariants of so(V) 


Let us consider the Casimir invariants of the Lie algebra so(V) de- 
scribed by the generators Jay satisfying (see, for example, (3.26) and 
(3:27)) 


Jad = = hes 
[Jans Jed] = Sacdbd te Ona ae —, Onaedlve anes ObeJad, (10.102) 


where a,b,c,d = 1,2,---,N. This algebra possesses a p-th order 
unsymmetrized Casimir invariant given by 


1 
ie 7 TE las dedaten tthe ta) (10.103) 
for p > 2. However, if N = 2n is even, then, in addition to the 
invariant [,, given by (10.103), there exists an additional n-th order 
Casimir invariant of the form 
~ 1 


Jn = Qn)! ¢21b142b2--anbn Tr (Jaibi Jasbs tee dab) ; (10.104) 


where €@°1""¢nb’n denotes the completely anti-symmetric Levi-Civita 
tensor in the 2n dimensional space. This marks the difference between 
the two cases B,, = so(2n +1) and D, = so(2n). Moreover if p is an 
odd integer, then J, can be reduced to a sum of lower order Casimir 
invariants (of even order). For example, let us consider the case of 
p=3 with 


1 
I3 = 31 Te (Jaya: Jaza3Jaza1) 
(=1)* 
= 3I Tr (Japa; JasazJazas) 5 (10.105) 
where we have used Jay = —Jpq. Furthermore, using the commutation 


relation between the generators as well as the anti-symmetry of the 
generators given in (10.102), we can write 


Jasay Jazaz = Jazaz Jara ae [Jazar» Jazaz] 


a Jazas Jara ie Oajas Jara, a Onde Jasés = Oenes Jaras 
= Jazacdacay + Jaraz + Jasaz => N Jayaz 
= Jazaz Jaga, — (N — 2) Jars: (10.106) 
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Substituting this into (10.105) we obtain 
1 
I3 = 31 Tr ((Jasa2Faza1 — (N — 2) Jara3) Jaras) 


1 (N —2) 
=-7; Tr (Jaza2 Jaca: Jaras) -_ 31 


dr (Jarag Jaras) 


(N — 2) 
3! 
(N — 2) 
3! 


Tr (Jaras (—Jaga1)) 


= —I5+ 


(2!) , (10.107) 


so that we can write 


ie (10.108) 


However, if p is even with p > 4, the invariants J, are not ortho- 
symmetrized. Setting a = (a,b) with a > b and with Jap = Xq, the 
even order Casimir invariant J, can always be written in the form 


Fee alae ae Cle ot. OD. Ceece ey. Coe (10.109) 


with some coefficients h®1°2""°» which are not symmetric in the in- 
dices a1,Q2,:-:Qp, in general. On the other hand, we can always 
reduce the coefficients into a sum of ortho-symmetrized tensors of 
the forms g@102""OR , g%102 g%304"Gp and so on, so that Ip can be re- 
duced to the standard ortho-symmetrized Casimir invariants I). 

Next, let us return to the second order Casimir invariant Jy for 
the general simple Lie algebra. We note that 


Tr (XoXg) = Cop; (10.110) 


where c is a constant. Multiplying (10.110) by g®? and summing over 
repeated indices we obtain 


go? Te Gs) Seg" Gam ed (parm); 
or, d(p)12(p) > cd( Padi): 


d(p) 
or, c= Ia(p). 10.111 
d(Padj) ate) 
Therefore, from (10.110) we recover the relation 
d(p 
Tr (XoXg) = - 2 12(p) Gag 
adj 
d 
= te) In(p) Tr (tag), (10.112) 
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which we have already noted (and used) earlier in (10.80). Here xq 
denotes the generators in the reference representation {/0}. 

As an application of this formula, let us consider the simple case 
of L = so(3) ~ su(2) where, as we have already seen, the angular 
momentum states are labelled by 7 = 0, 53 ae 3, --> and m= —j,-j7+ 
1,---,j-—1,7. The dimensionality of any representation labelled by a 
given 7 is, therefore, obtained from the number of values m can take 
and leads to 


d(p) = d(j) = 25 +1, (10.113) 
while for the adjoint representation we have 
d(pagj) = 2? -1=3, (equivalently, 2x1+1=3), (10.114) 


corresponding to the dimensionality of the algebra or the number of 
generators (for su(n) it is n? — 1). Let the reference representation 
{po} correspond to the fundamental 2-dimensional representation for 
which j = 4 so that d(po) = 2. 

Let us note that the second order Casimir in the reference rep- 
resentation is given by 

Tn(po) = 1Pat). (10.115) 

d(po) 

This follows from (10.112) if we set p = po or it can be derived directly 
from the definition in (10.69) as follows 


9°? Tr (tatg) = 9°? gag = 5% = d(pagj)- (10.116) 


On the other hand, from the definition of the second order Casimir 
invariant (see, for example, (10.71)), we have 


g°? Tr(xaxg) = Tr (2(p0)10) = L2(p0)d(p0), (10.117) 


so that (10.115) follows from (10.116) and (10.117). We note that, in 
the present example (of so(3) ~ su(2)), we have 
U(pagj) _ 3 
Ig(po) = —— 10.118 
(50) = pes — 5 (10.118) 

For the present example of so(3) ~ su(2), we also know (see, for 
example, (3.122)) that the second order Casimir invariant is given up 
to a normalization constant by 


: a ; 1 
Ble) = BG) =IG+), F=05.L--, (10.119) 
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where c’ is a normalization constant. Recalling that for the reference 


1 


representation p9, we have 7 = 5 and I2(po) = 3 we determine the 


normalization constant (from (10.119) by identifying p = po) to be 
cd =2, (10.120) 
which leads to 
In(p) = In(9) = 290 + 1). (10.121) 


The multiplicative factor 2 in (10.116) (compared to (3.122) or the 
conventional definition) can be absorbed into the normalization of the 
Casimir invariant, for example, by defining the second order invariant 
by Ip > 51. If we now consider (10.112) for Xy = Xg = X3 = J3, 
then, for any representation, we have 


j 


Tr (X3.X3) = Tr (J3)? = Som? = a(j +.1)(27 +1) 


7 a jj +1) ee Cade 
_ _d(p) 
aaa J(p) Tr (7373), (10.122) 


which agrees with (10.112). 

Let us next comment on the third order Casimir invariant J3. 
Among simple Lie algebras, L = su(N),N > 3 and so(6) ~ su(4) 
alone have unique third order Casimir invariants. We note here with- 
out going into details that this implies the absence of the triangular 
gauge anomaly for all simple Lie algebras except for these special 
algebras. Absence of gauge anomaly ensures the renormalizability 
(and consistency) of the underlying grand unified field (gauge) the- 
ory based on such a L. 

As we have already noted, the Lie algebras L = su(2), su(3), Go, 
4, Eg, E7 and Eg do not possess the fourth order Casimir invariant. 
Let Xqs denote the generators of such a Lie algebra in the generic 
irreducible representation {p} and let us define 


1 
Gass = So (Tr (XqXgXyX>5) — cTr (XX pg) Tr (X_X5)), 
“Pp 


(10.123) 
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where c is a constant and the summation is over the (4!) possible 
permutations of the indices a, 6,y,6. This, therefore, defines a totally 
symmetric structure and we require (see (10.98)) 


9°? 9” gapys = 0, (10.124) 


which after some calculation determines 


= = (Pad; ) To(Padj) 
BO 5G a(p.4)) A) (0-“Ee?)—— ao.azs 


Since the Lie algebras under consideration do not possess a genuine 
fourth order Casimir invariant, we must have gags in (10.123) vanish 
identically, namely, 


iD (Tr (XoXpXyX5) — Ka(p)Tr (XoXg) Tr (X,X5)) = 0. 
P 
(10.126) 


We can rewrite this relation more compactly as follows. Let €%,a@ = 
1,2,--- ,M be arbitrary constants and let us define 


X =€°Xy, (10.127) 


which would then correspond to an arbitrary element of the Lie al- 
gebra L in the irreducible representation {p}. Multiplying (10.126) 
with €°€%€7€° and summing over all the indices we obtain the trace 
identity 


Tr X4 = Ka(p) (Tr X?)’, (10.128) 


which holds for these Lie algebras. 

As an application, let us continue with the example of L = su(2) 
with X = J3 in the irreducible (27 + 1) dimensional representation 
space with 7 = 0, + 1, 3, +++, as we have already discussed. Then 


using (10.113)-(10.122) we have 


3 4 
Kale) = say LD (s 2G + ») 


3 1 
— 574) (s 0) 
j+1)(23 +1) 


Tr (x2) = Jf : 
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so that 


2 HGF DQI+D , 


Ka(p)(Tr X° 2 


33 +1)-1). (10.129) 


On the other hand, we can evaluate directly (using the table of inte- 
grals) 


RX )=t3)= 3 m* 


m=—J 


_IG+ QI +1), 
15 


which coincides with (10.129) proving (10.128). 

Finally, we note from the classification given earlier that both 
su(2) and Ex do not have any sixth order invariant (in addition to 
the third and the fifth order invariants). In these cases, proceeding 
as before, we can obtain the trace identity 


339 +1)-1), (10.130) 


Tr(X°) = Ke(e) (Ir X) oae)) 
where 
ao 15 d(Padj) : 
Kol?) = along) + 2) (Pag) +4) ( d(o) 
1Io(Pag) , 1 ( L2(Caqj) : 
(Fate a (Se) aos 


We note that d(pag) = 248 for the exceptional Lie algebra Eg and 
relation (10.131) is useful in showing the absence of mixed gauge- 
gravitational anomaly for the heterotic string theory based on Eg x 
Eg. For the case of su(2) we have (see (10.113)-(10.122) as well as 
(10.132)) 


3 1 : 
(x)= Om = (Fig+nei+y) . 


15 a. oe 
Kelp) = (34+2)(3+4) (5) 


Via). fay 4 
a 25G+1) "12 Gey) 
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33 
21(5(7 + 1)(27 + 1))? 
x BGG +1)? -379+1)41), (10.133) 


so that we have 


K¢6(p ) (Trx?)* = aC + 1)(27 +1) 


x (3((G +)? -37G +1) +1). — (10.134) 
On the other hand, a direct evaluation leads to 


Tr X® = Tr (J3)® _> m® 


m=—j 


= 5 AG+ D241) 


x 3G +1)? - 379+) +1), (10.135) 


which coincides with (10.134) proving that (10.131) holds. 
Since the Lie algebra satisfies (see (10.5)) 


Kay Xp, Xs (10.136) 


it follows from this that X, = —XT (where XJ denotes the matrix 
transpose of XQ), also satisfies 


Kei Xgl aie xy. (10.137) 


X, is known as the conjugate or the dual representation {p} of {p}. 
Then, under the transformation Xq — Xq, we have 


i 
I,(p) = d(p) ea ay (Xa, Xaz nia Xap) 
1 20. Gt 
— I,(p) = I greere Tr Xi Xan Kew) 
= (-1)? Ip(0), (10.138) 


since d(p) = d(p). (Note that the structures g®!°2""°? involve the 
fixed reference representation and, therefore, are not affected by this 
transformation.) Therefore, for even p = 2m we have Igm(p) = 
Inm(p). Since for simple Lie algebras su(2), so(4m), so(2m+1), sp(2m), 
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G2, F4, E7 and Eg do not have any odd order Casimir invariant, we see 
that all the Casimir invariants of these algebras are invariant under 
Xo 7 ben This suggests that, for these algebras, the two irreducible 
representations {p} and {p} are equivalent, namely, there exists a 
nonsingular matrix S (similarity transformation) which relates the 
generators in the two representations as 


Nee SSNs: (10.139) 
Taking the transpose of both sides of (10.139) we obtain 
ES SUES) SS SoS ys 
or, [Xg,S7S7'] =0, 
or, S?S-'=)1, or, S7=AS, (10.140) 


for some constant ». Taking the transpose of the last relation leads 
to 


S=A8S? =S5, or, A=+1, (10.141) 


so that the similarity transformation satisfies 


ST =+8. (10.142) 


Representations satisfying (10.142) are known respectively as real and 
pseudo-real representations depending on the two respective signs. It 
is known in mathematical literature that any irreducible represen- 
tation of these Lie algebras without odd order Casimir invariants 
(mentioned above) is indeed self-conjugate, namely, {p} and {p} are 
equivalent. As a result, all of their irreducible representations are 
either real or pseudo-real. On the other hand, for simple Lie algebras 
su(n),n > 3, so(4n +2) and Eg¢ which possess at least one odd order 
Casimir invariant, some of their representations are complex, namely, 
the representations of these algebras are neither real nor pseudo-real. 
For the case of su(2), it is known that any tensor representation is 
real, but spinor representations are pseudo-real. 

The adjoint representation of a simple Lie algebra is always self- 
conjugate and real, namely, it satisfies 


—(adXy)? =g adX,)9, g' =9, (10.143) 


where g and g~' are M x M matrices with M = d(aqj) and are 
defined by the metric as 


(9)o8 =9o8: (9 *)ap = 97°. (10.144) 
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This can be seen as follows. Let us note that (see (3.112)-(3.115)) 


((ad Xa)9) By = (ad Xa) 8595+ -_ Co e954 = fapy; (10.145) 


which is completely anti-symmetric in all the indices. We can now 
take the transpose of (10.145) and using the anti-symmetry of the 
structure constant as well as the symmetry of the metric, we obtain 


((ad Xq)g)* = —(ad Xq)g, 
Or, glad Xa)" = —(ad Xo) Os 
or, —(adX,)? =g¢ (ad X,)g; (10.146) 


which proves (10.143). 


In ending this discussion we will give some examples of J,(p) 
for the su(n) Lie algebra. Let X‘, denote the generators in the Lie 
algebra u(NV) and define 


1 
AN, = XH x*. (10.147) 


Clearly, these are traceless generators and belong to su(n). Us- 
ing these we can obtain the unsymmetrized the p th order Casimir 
invariant of su(N) from those of u(N), symmetrize and orthonor- 
malize them suitably as follows. If p = (fi, fo, fz,---, fn) with 
fi > fo > --: > fm represent the Young tableaux corresponding 
to the irreducible representation of su(n), it is more convenient fo 
our purposes to introduce 


1 ea te 
o9=fitgN+)-j-_dife=4- Fd fe (10.148) 
k=1 k=1 
for 7 = 1,2,--- ,N and they satisfy 0, > 02 >--- >on as well as 


N 
gee: (10.149) 


We note that oj defined in (10.148) is invariant under the shift f; > 
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fj +A for any arbitrary A. We can now easily calculate 


N N? 
In(p) = NY 0(0;)? - oo (Gy? 1); 
j=l 
N 
Is(p) = N? Y0(0;)°, 
j=l 
N N 2 
Is(p) = N(N? +1) 9 (0;)* — (2N? — 8) | $5 (05)? 
j=l j= 
. = N?(N? — 1)(N? — 4)(N? — 9) 
N 
Is(p) = N?(N? +5) S_ (03)? 
j=l 
N N 
— 5N(N? — 2) S (03) (32160 ; 
j=l k=1 
N 
Is(p) = N(N* + 15N? +8) ¥ (a; 
j=1 
N N 
— 6(N? — 4)(N? +5) | $0(0;)? (32100 
j=l k=1 
N 2 
+ 7N( 7) Sin" 
j=l 
N 2 
(3N* — 11N? + 80) | $0 (0;)° 
j=l 
N?(N2 — 1)(N? — 4)(N? — 9)(N? — 16)(N? — 25) 
30240 


(10.150) 


and so on. Here we have changed the normalization of I,(p) by 
multiplying suitable constants to make it simpler. Also the formula 
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for the dimension is now given by (see (10.55)) 
1 N 


d(p) = (Day (ND) [] (i - on). (10.151) 


j<k 


We note here that since the su(3) Lie algebra, or example, does not 
possess the 4th, 5th and the 6th order Casimir invariants, we must 
have 


T4(p) = I5(p) = Ie(p) = 0, (10.152) 


identically for N = 3. We can verify these directly as follows. For 
example, J4 = 0 for N = 3 we obtain from (10.150) 


2 


3 3 
30S (o;)*- 15 | So(o;)? | =0, 
j=l j=l 
1 
or, a +h+4+= 5 (a oy ge oa ae (10.153) 


fora+b+c=0. Here we have identified a = 01,b = 02,c = 03. To 
verify the other cases, let us, for example, choose a totally symmetric 
tensor representation (f1, f2,--- , fy) = (f,0,0,--- ,0) where we have 
identified f, = f. In this case, it follows from (10.148) that 


Gi) ; N+1- 2) 


10.154 
N 5 ( ) 


OF = 


Let us denote this particular representation as Ay and note from 
(10.150) that it leads to 


In(Ay) =(N-DFN+P), 


Is(Ay) = 5 (N — 1)(N ~2)F(V + AUN +28), 
Ta(Ay) = 5 (N - 1)(N - 2)(N - 3) FV + f) 
x (N(N~1) + 6f(N +), 
Is(Ay) = 57 (N ~1)(N —2)(N —8)(N — 4) FOV + AYO +28) 


x (N(N —5)+12f(N+f)), 
Ig(Ap) = (N —1)(N — 2)(N — 3)(N —4)(N — 5)f(N + f) 
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| N(N - Oe 15N — =, (10.155) 


and so on. It is clear from (10.155) that I4(Ay) = 0 for N = 2,3; 
I5(Ay) = 0 for N = 2,3, 4; Ig(Ap) = 0 for N = 2,3,4,5 in accordance 
with the classification given earlier. 

For the totally anti-symmetric representation 1/ with the Young 
tableaux 

17 = (1,1,1,--- ,1,0,0,--- ,0), 
——S 
f 

the corresponding p th order Casimir invariants are formally obtained 


by replacing N —> —N in (10.155) for the case of I,(Ay) so that, for 
example, we find 


Bae 5 (N +1)(N +2) F(N — f)(N — 2f). (10.156) 


Remarkably, the same replacement (NV + —JN) also holds for the for- 
mula of the dimension (10.151) except for the additional multiplica- 
tive factor (—1)/. For example, we note from (10.151) that for the 
symmetric representation with f = 2, we have d(A2) = 5 N(N + 1) 
while (with the replacement) for the anti-symmetric representation 
we have d(1?) = cy N(N —1) =4N(N — 1). (See P. Cvitanovic 
and A. D. Kennedy, Phys. Scripta 26, 5 (1982).) 


10.4 Generalized Dynkin indices 


Let {p} be a representation of a simple Lie algebra which is not 
necessarily irreducible. For the symmetric orthonormalized p th or- 
der Casimir invariant J, (in this section only we denote the Casimir 
invariants by J simply because the example of the standard model in- 
volves isospin generators which are labelled conventionally by [;,7 = 
1, 2,3) defined as in (10.101) (and discussion around there), we define 
the p th order Dynkin index for the representation {p} by 


D,(p) = Tr Jp. (10.157) 


If {p} is irreducible, then as we have already seen J, = Jp(p)1 so 
that we have 


Dy(p) = Jp(p)d(p), (10.158) 
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for any irreducible representation {po}. On the other hand, for a 
general representation, we decompose {p} as a direct sum of its irre- 
ducible components as 


{o} = >_ @{pi}, (10.159) 
1=1 


corresponding to the decomposition (.S denotes a similarity transfor- 
mation) 


x 


boy, 
X,=S'X,S = (10.160) 


< 


for irreducible matrices xO 4 = 1,2,---,n. Taking the p th power 
of this relation, tracing over the matrix indices and multiplying with 
the structure g®!°2""% we obtain the Dynkin index for the general 
representation to be (Jp(p;) = ger op XO XO) ee @) 


Jp(p1) 11) 
Jp(p2) 1 (2) 
D,(p) = Tr 
= S— Jp(p:)d(oi) = S> Dp(pi); (10.161) 
i=1 i=l 


where we have used (10.158). Together with the standard relation 
d(p) = > d(pi), (10.162) 
i=l 


these relations are often sufficient to determine the decomposition of a 
reducible representation {p} into a sum of its irreducible components. 

As we have noted in the previous section, any simple Lie alge- 
bra which does not possess the third order Casimir invariant has no 
triangle anomaly. For an irreducible representation {p}, therefore, 
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this corresponds to D3(p) = 0. However, if the third order Casimir 
invariant does not vanish for a simple Lie algebra, then if {p} is re- 
ducible, then the condition for the absence of the triangle anomaly is 
given by 


D3(p) = )> D3(pi) = 0. (10.163) 
i=1 


The well known example is the grand unified theory based on su(5) 
which does possess a third order Casimir invariant. However, it is 
known that for a representation ({p}) consisting of a sum of the 
5 and 10 dimensional irreducible representations of su(5), we have 
Ds3(p) = 0 and there is no triangle anomaly in such a theory. As 
we will show in the next chapter, this is related to the branching 
rule for the decomposition of so(10) into su(5). We note that so(10) 
does not have a third order Casimir invariant and, therefore, does 
not have the triangle anomaly (D3(p) = 0). The 16 dimensional 
spinor representation of so(10) decomposes into a direct sum of the 
5 and 10 dimensional irreducible representations of su(5) and the 
branching sum rule immediately gives D3({5}) + D3({10}) = 0. (We 
note that there is also a trivial one dimensional representation in the 
decomposition, but D3({1}) = 0.) 

As a slight digression, let us note that so far we have only dis- 
cussed about simple Lie algebras. However, sometimes it is of interest 
to consider non-simple algebras in physical theories. A familiar ex- 
ample of such a theory is the standard electro-weak theory based on 
the SU(2) ® U(1) gauge group. In this case, we do have a third 
order Casimir invariant given by J3 = Jy ® J, where J; and Jo 
are respectively the first and the second order Casimir invariants of 
L = su(2) @ u(1). In a given irreducible representation {p;}, there- 
fore, this would imply D3(p;:) = J3(pi)d(pi) = Jo(pi)Ji(pi)d(pi). As 
a result, the absence of triangle anomaly in a general reducible rep- 
resentation in this theory is determined from 


Da(p) = >) Ds(pi) = 95 Ja(pi) Au(9) (vi) 
i=1 


i=1 


n 
=S° (i+ Y¥i(2i; +1) =0, (10.164) 
i=1 


where I;, Y; denote the isospin and hypercharge quantum numbers 


of the states in {p;}. (Here we have identified J; = 4) It is well 
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known that in the electro-weak theory such a condition holds because 
of cancellation of the contribution due to quarks and leptons. 

With this small digression, let us return to our discussion of 
simple Lie algebras. One interesting example is the Clebsch-Gordan 
decomposition of a product representation {p,4} @ {pg} of two irre- 
ducible representations {94} and {pg} of L into its irreducible com- 
ponents, 


{pa} ® {pp} = >_ Of pi}. (10.165) 
t=1 


We will show now that in this case, 


Dp(pa ® ps) = d(pp)Dp(pa) + U(pa)Dp(pp), (10.166) 


so that Dynkin’s index formula leads to the sum rule 


n 


Dy(pa ® ps) = S| D(pi), 
1=1 


or, d(pp)Dp(pa) + d(pa)Dp(pz) = >_ d(p:) Jp(pi), 
i=1 


or, d(pa)d(pp) (Jp(pa) + Jp(or)) = >_ d(pi) Jp(pi), (10.167) 
i=1 


since 94, PB, /p; are all irreducible by assumption. 
To prove (10.166) we note that for a tensor product representa- 
tion of two irreducible representation, we can write 


Xy=XA4 @1gp4+14,@ XxX), (10.168) 


so that it follows 


M 
Dp(pa ® ps) = Ss" gn ee (Xa, Xaz i X ey) 


Q1,02,°'Ap=1 


M 
a DCm! ae — x!) Tr (1p feat 1p) + (A oS B) 


1,02, Ap=l 
+R 
= d(pp)Dp(pa) + d(pa)Dp(pp) + R, (10.169) 
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where R contains all the mixed terms involving the products of Xqs 
in the two representations. To show that R = 0, let us recall that, 
for simple Lie algebras, Tr X, = 0. However, R may contain traces 
of quadratic or higher order products of the generators. For example, 
suppose R contains a term of the form 


Geer Te AY) THX) 


B 
a1 a2 a3 ies oe x), (10.170) 
then we can write (the derivation is a bit involved so that we only 


give the result here) 


Tr (XQ) XQ”) = Ca grass 


Tr (x st xB )=CpB Gor3-Op 1 CpGseaGaaie shes; CUO.1F1) 


where C4, Cg, C'z,-:: are (representation dependent) constants. On 
the other hand, orthogonality conditions such as (see, for example, 
(10.101)) 


Gone Jory a290304:Ap = 0, (10.172) 


etc makes such a term vanish identically. In a similar manner, it 
can be shown that all the other kinds of terms in FR involving higher 
powers of Xq also vanish so that R = 0 and we have the sum rule 
(10.166). It is worth emphasizing here that the orthogonality rela- 
tions are crucial for the validity of this sum rule. 

Another use of the Dynkin indices is within the context of the 
branching rule. Let Lo be a simple sub-Lie algebra of a simple Lie 
algebra L (for example, L = Eg and Lo = Ds = so(10)). Then, an 
irreducible representation {po} of LZ, in general, becomes a reducible 
representation of Lo so that we may decompose {p} as a direct sum 


{o} = Sof}, (10.173) 


of the irreducible representations {pO of the subalgebra Lo. For 
this case we can write 


&>Dp(p) = S~ Dp(p$””), (10.174) 


where &, is a constant which does not depend on the representation 
{pe}. To prove this relation we note that, for generators Xq of the Lie 
algebra L we can write 


1 
BI da Tt (XasXa2“-- Xap) 
“ ?P 


= CyDp(0)Gox---0p + CpGora2Ias--ap t*** (10.175) 
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where the sum over P denotes sum over the p! permutations of the in- 
dices a1, 2,+++ ,Qp. We recall that g™°""°P X, --- Xa, = Jp where 
Jp is the p-th order symmetrized (but not orthogonalized) Casimir 
invariant. As a result, (10.175) basically expresses the fact that the 
left hand side can always be decomposed into a sum of gqy...8 satis- 
fying relations as in (10.101). Let X; with j = 1,2,--- ,Mo(< M) 
denote the generators of Lo (here dim Lg = Mo). Then restricting the 
generators to this subalgebra, we can take the trace, multiply with 
the pth order structure gj'””"’” of Lo and sum over all the indices to 
obtain 


Gr PTX Kya Xs) 
= CpDp(p) 99° 7? G51---5p + Cp90”? GirieGiedp + °°» (10.176) 


where we have used (10.175). Furthermore, we can express 


0 0 
Ijjo = Bog, Ijrjois = Bags a and so on, (10.177) 


where (1, 62,--- are constants. In view of the orthogonality condi- 
tions (see, for example, (10.172)) 
sie oa ds Tae es 
Ge atte =O: (10.178) 


and so on, we obtain (10.174) 
S~ Doles”) = & Dp), (10.179) 


with 
(eed OF aaa eee (10.180) 


Note that €, could be identically zero, since Lo may not possess the 
genuine pth order Casimir invariant. The value of €, can be deter- 
mined in general as follows. For the simplest irreducible representa- 
tion {p} of L, the decomposition {p} = >>, @f{pO} is relatively easy 
to determine and one can calculate €, in this case from the sum rule. 
However, since €, is independent of the representation {p}, this value 
can then be used for any other irreducible representation of D in the 
formula for the branching rule (10.174) (or (10.179)). (For some other 
types of indices which satisfy analogous sum rules for such decompo- 
sitions, see S. Okubo and J. Patera, J. Math. Phys. 24, 2922 (1983); 
J. Math. Phys. 25, 219 (1984). Other indices are also useful in the 
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study of anomaly free gauge field theories in the higher dimensional 
space-time, see, for example, Y. Tosa and S$. Okubo, Phys. Rev. D36, 
2484 (1984); Phys. Rev. D37, 996 (1988); Phys. Rev. D40, 1925 
(1989).) 

We have already noted the validity of the 4th order trace identity 
(10.128) when the simple Lie algebra L does not possess the genuine 
4th order Casimir invariant. We note, however, that for any simple 
Lie algebra we have a more general relation given by 


Tr X4 — K4(p) (Tr X?)” = fu(X)Da(p), (10.181) 
where X is defined in (10.127) and f4(X) denotes a scalar function of 


X obtained as follows. We recall that the pth order Casimir invariant 
is defined as 


J = gree ee Xi ee Pore Xap: (10.182) 
In terms of these we can write 
1 a 
fr(€) = D(p0) gee ne Cay Gas a a= fp(X)), (10.183) 


where {po} denotes the reference representation. Similarly, we have 
Tr (X°) = f3(X) D3(p). (10.184) 


Let us conclude by noting that if X denotes the irreducible 
matrix representation of the generator X in the representation {p}. 
Then fp(X) = fp(X) in view of the same value of Jps. This leads to 
the relation 


Tr X4 — Ka(p) (1X2) Dal) 


Tr X4 — K4(p) (Tr_X2)?_ Da(p)’ 


and 


TrX®  Ds(p) Tr X?___ Da(f) 


= = 10.1 
TeX? D3(p) rx Data)’ ee 


and so on. These relations are valid for any irreducible matrix repre- 
sentations of X and X for any simple Lie algebra. 
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CHAPTER 11 


Root system of Lie algebras 


We have derived the irreducible space of the Lie algebra su(N) from 
the unitary group U(N) by the method of Weyl. However, it may be 
desirable to obtain the same directly from the Lie algebra structure 
itself without utilizing the global group structure. This is the goal 
of the present chapter. The method to be used, for this purpose, 
is essentially special cases of the more general Cartan-Dynkin theory 
which is discussed in great detail in the literature and which we review 
briefly in the next section. The present discussion will allow us to see 
its connection with the Young tableaux method discussed earlier. 


11.1 Cartan-Dynkin theory 


In this section, we will give a brief description of some of the im- 
portant results of the Cartan-Dynkin theory without proof which is 
beyond the scope of the present book. For a comprehensive account 
of the results, we refer the reader to the references at the end of this 
chapter. 

Let G denote a finite dimensional simple Lie group with LD rep- 
resenting its Lie algebra defined over the complex field. Let tg,a = 
1,2,--- ,dim JZ correspond to the basis elements of L where dim L 
denotes the dimension of EL. For example, the dimension of the Lie 
algebra su(V) is given by 


dim su(N) = N?—1. (11.1) 


As we have discussed the elements of the algebra can always be cho- 
sen such that they define an orthonormal basis so that we can always 
diagonalize the metric with suitable normalization of t, (see, for ex- 
ample, (8.71)) in the form 


Jab = Tradt, adty = Coqgp. (11.2) 
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However, even though this is the natural basis for the Lie algebra, var- 
ious algebraic questions such as the representation theory are much 
better studied in an alternate basis known as the Cartan-Weyl ba- 
sis. Here the generators are divided into two distinct classes in the 
following way. 


1. Cartan subalgebra: There exist ¢ maximally commuting ele- 
ments of L denoted by Hy, H2,--- , He such that 


[Hi,H;]=0, i,j =1,2,---,2 (11.3) 


The integer @ is known as the rank of the Lie algebra, namely, 
rank L = é. For example, for L = su(N), we have 


rank su(N) = N—-1. (11.4) 


It is clear that the generators of the Cartan subalgebra are not 
unique. In fact, given a set of generators {H;} satisfying (11.3), 
we can always define another set given by 


e 
Hi = Sagi; aj; = constants, (11.5) 
j=l 


which would also satisfy (11.3). Therefore, we can, in general, 
choose a set of generators which would satisfy 


Gij = Co Trad Had Hj = Cb;;, 4,9 =1,2,--- ee (11.6) 
However, we will not use this special form for a while. 


2. The complement: We will assume that the generators of the 
Cartan subalgebra are Hermitian, namely, 

H' = H; (11.7) 
partly because it facilitates our discussion, but mainly because 
Hermiticity is quite important in physics. In fact, the Hermitic- 
ity follows essentially as a consequence of the Cartan-Dynkin 
theory and has been utilized by Weyl to prove the full reducibil- 
ity of any representation of a simple Lie algebra L. (This is the 
celebrated unitarity trick of Weyl. The full reducibility can now 
also be proved using cohomology theory without using Weyl’s 
unitarity trick. But these topics are beyond the scope of this 
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book.) We note that in the adjoint representation, we can also 
write (11.3) and (11.7) as 


[ad H;, ad H;] =ad [Batts = 0, (ad H;)" = ad Hj, 
(11.8) 


Since there are € such commuting Hermitian generators, we 
can diagonalize them simultaneously with real eigenvalues a,, 
namely (in the adjoint representation), 


(adH,)X =a;X, aks =a;, i=1,2,---,6 (11.9) 


for some X € L. The zero eigenvalue solutions with a, = 
a2 = --: = ag = O for this equation always exist for X = 
Hj,j = 1,2,---,@ and they naturally define a ¢-dimensional 
vector space Vo. Let e!,e?--- ,e’ denote a set of complete basis 
vectors in this space. Then, any nontrivial vector a # 0 in Vo 
can be written in this basis as 


é 
a= > aye’ = aye! + age? +--+ are’, (11.10) 


i=1 


where a; # 0 denote the components of the vector @ in this 
basis. 


With the help of these vectors, one can now try to organize 
the rest of the generators of L by taking complex combinations 
of them, much like the raising and lowering operators (step 
operators), Ji, of angular monemtum (see, for example, (3.59)). 
In this way, we can find generators Eq for a vector a # 0 to 
satisfy 


[Hi, Ho] = aika, 
ots (ad He =O.» Ca 12. eta 4d, (11.11) 


so that a;’s represent the eigenvalues of H; in the adjoint repre- 
sentation. We note here for future use that (11.11) is invariant 
under Eq + CaEa where Cas are arbitrary constants. 


3. Root vector: We can think of the components a; which satisfy 
the commutation relations (11.11) to correspond to structure 
constants in the Cartan-Weyl basis. A specific nontrivial vec- 
tor a with components given by these structure constants (a; 
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satisfying (11.11)) is known as a root vector and since there are 
(dim L—rank L) number of generators in the complement, there 
will be as many nontrivial root vectors in this basis. (In this 
language, then, the Cartan subalgebra can be thought of as the 
algebra of the generators with vanishing roots.) For example, 
we note that the number of root vectors in su(NV) is given by 


dim su(N) — rank su(N) = (N? — 1) — (N—- 1) 
= N(N —1). (11.12) 


Note that this is an even integer. 


. Properties of roots: It is known that we can classify all the 


nontrivial roots of a Lie algebra into two groups. If the compo- 
nents of a root given by 


L 
a= S ae’, (11.13) 
i=1 


satisfy a; > 0,2 = 1,2,--- ,2, then it is called a positive root 
(the generator F,, in such a case will correspond to raising (step 
up) operators) while if all the components are negative, a; < 
0,2 = 1,2,--- ,@, it is called a negative root (the corresponding 
generator E, would represent a lowering (step down) operator). 
It is also known that a;s are all integers. 


Since the generators H; are Hermitian and the components a; 
are real, by taking the Hermitian conjugate of (11.11), we ob- 
tain 


[H;, E'] = —a;Et. (11.14) 


This shows that if @ is a root vector, then (—q@) is also a root 
vector and that we can identify 


EL=E., (Hj, Bx] =—ajF-e, (11.15) 


and the even number of generators in the complement (see, 
for example, (11.12)) can be arranged into pairs of raising and 
lowering (step) operators. (In addition, one can show that for 
a simple Lie algebra, the roots are not degenerate so that we 
cannot have two generators in the complement, F,, E’,, corre- 
sponding to the same root.) 
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It is also worth pointing out here that the dependence of the 
generators on the roots is not linear so that the sum of two 
roots is not necessarily a root, namely, 


Eq + Ep # Eats: Eko # kEo, (11.16) 


where 
a= S aie’, B= Me bie’, Clk 7) 


denote two roots and & is an integer. 


5. Commutation relations: We note from the Jacobi identity 
that the generators of L must satisfy 


[Hi, (Ea, L]) = [[Hi, Fo], Ba] + [Ea, [Hi, Ep] 


= (a; + b;)[Ea, Eg], (11.18) 
where @ and @ are two roots with components a; and };. This 
implies that if @ = —a, we have 

[Fis oe Fey] == Os (11.19) 


On the other hand, since H;s denote the maximal number of 
commuting elements in L, (11.19) implies that we can write 


bp oepe ee = Som = Hy, (11.20) 


for some constants a’. 


To determine the constants a’ and a;, let us introduce the metric 
tensor 


Vij = Co Tr (ad H; ad H;), (11.21) 


where Co is a normalization constant usually chosen to be unity. 
We can now calculate 


Tr (ad H; [ad Eq, ad E_q]) = Tr (adH; ad |Ea, E_a]) 


e e 
= Tr (ad H; ad()~ a! H;)) = Se a! Tr (ad H; ad H;) 
j=l j=l 
ie 
ar Sa! giz. (11.22) 
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Here we have used (11.20) and (11.21) in the intermediate steps. 
On the other hand, using the cyclicity of the trace we also have 


Tr (ad H; [ad E., ad E_,|) = Tr ([ad Hi, ad Eq] ad E_q) 
= Tr (ad [H;, E,| ad E_.) = a; Tr (ad Ey ad E_q), (11.23) 


so that from (11.22) and (11.23), we determine 
f . 
So gi" = Coa; Tr (ad E,, ad E_q). (11.24) 
j=l 


We note that the normalization of the generators Ey is ar- 
bitrary since the defining relation (11.11) is invariant under 
Eq — CaF. Therefore, we can choose the constants Cg such 
that 

Co Tr (ad E, ad E_.) = 1, (11.25) 


and, in this case, (11.24) leads to 
L £L 
a = Se 9507, or, a= So gay, (11.26) 
j=l j=l 
where g’! is the inverse of the metric Gij, namely, 


L 
k=1 


Returning to (11.18), we note that if a+ 4 0, then, in general 
we can write 


[Ea, Eg] = N(05, 8) Bai ps (11.28) 
for some constant N(a,). However, we also recognize that 
a+ may not be a root in general. In this case, either Ey 3 = 0 


or N(a, 8) = 0. 


Therefore, collecting all these results (together with (11.3) and 
(11.11)), we can write the complete algebra in the Cartan-Weyl 
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basis in the form 


By, A, =0 = [Aas tel; 

Hj, Eq] = aiEa, 
aA; =H, fat p=, 

Eo, Es] = N(a,8)Eaxg, ifa+Bisaroot, (11.29) 
0 otherwise. 


The constant N(a,() can be chosen to be real and can be 
calculated explicitly if we know all the roots of the algebra. 
However, even without the explicit form, it follows from the 
algebra that they satisfy various relations such as 


N(a, B) = —N(8,a) = —N(—a,—8) = N(—a,a + 8) 
= —N(—£,a+ 8) = —N(-a — £, B), (11.30) 


which are known as Weyl’s basis relations. 


Let @ and @ be two nonzero roots with 


L L 
a= Aye , B = bie ) 
i=l i=1 


and introduce an inner product by the relation 


£L 
(a, 8) = Tr (ad Ha ad Hg) = > g'ajby. (11.31) 
ig=l 


We then note from (11.29) that we can write 


[Ha, Eg] = Se H;, Eg] = Nowe 


= (a, B) Eg. (11.32) 
If we now identify 


(a)_ @)_ f_2 op a 
Ie = Ty Her Is ta, (11.33) 


(a, a) 
then, for any given a, they satisfy the su(2) algebra 


(IS, HO = I), (I), IO) = 24s, (11.34) 
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and the eigenvalues of Je) must have the form ini) for integer 
n©. Therefore, we conclude that the eigenvalues of Hy must 
have the form 


1 

5 (@) ni), (11.35) 
with n™ an integer. 

Let m; be an eigenvalue of H; (see (11.55)), namely, 


This leads to 
Had = (S\a'H;)b = (D> aimi)¢. (11.37) 
i=1 i=1 
If we now define a vector (see also (11.58)) 
é . 
M = S— me’, (11.38) 
i=1 


then it follows from (11.31) that 


£ £ 
(M, a) = Ss" gi mia; = So mai. (11.39) 
ij=l i=1 


It follows now from (11.35) and (11.37) that 


= integer. (11.40) 


Similarly, let us consider the adjoint representation which gives 
(ad Ha) Eg = [Ha, Eg] = (a, 8) Es, (11.41) 


so that ad Hy, (namely, Hq in the adjoint representation) has 
an eigenvalue (a, 3). Comparing with (11.35) we see that 


——— = integer, (11.42) 


for any two nontrivial roots @ and f. 
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6. Simple roots: It is clear that the structure of the Lie algebra 
is completely contained in its structure constants and, as we 
have noted, these are nothing other than the root vectors of 
the Lie algebra in the Cartan-Weyl basis. Furthermore, we 
have seen that the negative root vectors are merely negatives of 
the positive root vectors. Therefore, the positive root vectors 
really encode the structure of the Lie algebra. 


Let us define a simple root (also known as a fundamental root) 
as a positive root vector which cannot be written as a sum of two 
positive root vectors. With a little bit of work it can be shown 
that the number of simple roots equal rank L and they are all 
linearly independent. It is clear from this description that we 
can always identify the simple (fundamental) roots with 


a,=e', i=1,2,---,rankL. (11.43) 


Furthermore, as we have already discussed, any positive root 
can be written as a linear superposition of fundamental roots 
with positive coefficients. Namely, if a;,2 = 1,2,--- ,rank LD 
denotes a simple root system, then any positive root vector can 
be expressed as 


a= ajeu, (11.44) 
i=1 


where a; denote nonnegative integers. Therefore, the structure 
of the Lie algebra now resides completely in the simple root 
system of an algebra. 


7. Cartan matrix: Let us recall from (11.31) that, for any two 
nonzero roots @ and 3, we can write 


£ 
(a, 8) = Tr (ad Hoad Hg) = S~ a'b’'Tr (ad Hj ad Hj) 
ij=l 


U 
= So ab gis, (11.45) 
ij=l 
where we have used (11.6) (with Cp = 1). On the other hand, 


using the expansion in (11.44) we can express the left hand side 
of (11.45) as 


£ 


(a, 8) = S> ab! (ai, a5), (11.46) 


i,j=l 
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so that comparing (11.45) and (11.46) we determine 

(aj, a5) = gij- (11.47) 
With this, we can now define the Cartan matrix as (7,7 are 
fixed indices and j is not summed) 
2(aj, Oj) 
(aj, a5) 
which is an integer because of (11.42). It is clear from (11.48) 


that all the diagonal elements of this matrix are 2, namely, (7 
not summed) 


Ky =2, i=1,2,---,2 (11.49) 


On the other hand, the off-diagonal elements are known to take 
only four possible values, namely, for i 4 7 


Ki = 0,—1,—2, 3. (11.50) 


Similarly, we can introduce an angle between two fundamental 
roots Q;,Q@; as 


Seas. (11.51) 
V (ai, 4) (047, 45) 
and, for 1 4 7, they assume only four values 
6;; = 90°, 120°, 135°, 150°. (11.52) 
Note that we have Kj; ~ K;;, in general, with 
Ki Kj: = 4(cos 6:3). (11.53) 


The two sets of values in (11.50) and (11.52) can be shown 
to be in one to one correspondence. Diagrammatically, this 
is represented (for i # 7) as in Fig. 11.1 where each vertex 
represents a root vector and the number of bonds connecting 
two vertices corresponds to 


2(ax;, Oj) 
V (aj, Oj) (Ci, 4) 


For example, we note from (11.50) and (11.53) that when 6;; = 
90°, we have Ky; = Kj; = 0, while for 6;; = 120°, we have 
Ki; = Kj; = —1 Similarly, when 6;; = 135°, we can have 
Ky; = 2Kj, = —2 or Kj, = 2Ki; = —2, while for 6;; = 150° we 
have Kij = 3K jj =-—3 or Kyi = 3Ki; = —3. 


nig = gi = [Ky Kyi] = (11.54) 
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O ©) : 6; =90°, 


ay a; 
==) 64; = 120°, 
a; aj 
@=——6 ys 
Q; aj 
6 aed 
ay a; 


Figure 11.1: Diagrammatic representation of the off-diagonal ele- 
ments of the Cartan matrix. 


8. Dynkin diagrams: Dynkin diagrams are a diagrammatic way 
of representing the Cartan matrix and are built out of the el- 
ements known as nodes (basis vectors) and bonds representing 
the magnitude of the off-diagonal element of the Cartan matrix. 
We can always choose the normalization constant C’ in (11.6) 
such that the minimum of the norm among all of the @ vectors, 
namely, (@;,a;) for 7 = 1,2,---,@ is unity. In that case, it 
is known that the values that any of the norms can have are 
1,2,3. In some literature, these are represented diagrammati- 
cally as open circles with 1,2 or 3 on top denoting their norms. 
However, we will follow the slightly different convention (which 
is standard by now), namely, each node represented by an open 
circle denotes a vector a; with norm unity while a filled (black) 
circle represents a vector a@; with norm 2 or 3 (with some fixed 
normalization). The number of nodes corresponds to the rank 
of the algebra (and, therefore, gives the dimensionality of the 
Cartan matrix) and the number of bonds between two nodes 
represent the magnitude of the off-diagonal elements in the ma- 
trix (remember that the nontrivial off-diagonal elements are all 
negative). We will explain this in more detail within context 
when we discuss different algebras later. 


9. Weight vector: As we have mentioned before, since [H;, Hj] = 
0, we can simultaneously diagonalize these ¢ generators in any 
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irreducible representation V and we emphasize that V 4 Vo 
except in the adjoint representation. Therefore, in any repre- 
sentation we can write 


where ¢ € V is the simultaneous eiegenvector and the eigenval- 
ues ™m1,™2,--- ,mg are known as weights. If we wish, we may 
also represent the eigenstate in the Dirac notation as 


b = |p,™m1,mMg,-++ ,me), (11.56) 


in analogy with the angular momentum states |j,m) of su(2). 
Here p labels the irreducible representation space V (like 7 in 
angular momentum). Moreover, using the inverse of the metric 
gij (see (11.6)), to write 


m= Sg my, (11.57) 
which allows us to introduce a vector in this space given by 
oe 
M= Soma = So gmiay. (11.58) 


i=1 ij=l 


This is known as a weight vector. Note that since the vectors 
a; = e; define a basis in Vo, the weight vector is a vector in 
the root space, but, in general, may not be a root vector (e.g. 
the weight vector M can be identically zero etc). This suggests 
that we may label the simultaneous eigenvector simply as @jy 
or |M) so that we can rewrite (11.55) as 


H;|M) = m;|M). (11.59) 


Together with (11.57), this also suggests that 
eS) oi (11.60) 
satisfies the eigenvalue equation 
H'|M) = ) 0g! Hj|M) = 79" m;|M) 
j=l j=l 


=m'|M). (11.61) 
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Suppose now that |N) denotes another weight vector with com- 


onents n!,n?,--- ,n®, namel 
: ? ? ? p) 


H'|N) =n'|N). (11.62) 


In this case, if at least one of the components of M is larger 
than the corresponding components of N while the other com- 
ponents being equal, we say that the weight vector M is higher 
than N and denote this by M > N. For example, if m! > n! 
while m? = n’,i = 2,3,--- ,0, we say M > N. Similarly, if 
m? > n? while m' = n',i ¥ 2, we still say M > N. This is con- 
nected with the fact that the choice of the Cartan generators 
is nonunique as we have pointed out in (11.5) and using this 
one can always reorder the generators. In a finite dimensional 
irreducible representation, there always exists a weight vector 
A= ea a; such that any other weight M is lower than A, 
namely, A > M. In this case, one says that A defines the high- 
est weight vector determined completely by the highest weight 
state |A) = ¢,. This, of course, presupposes that the highest 
weight state is unique. For any finite dimensional irreducible 
representation, this can, in fact, be shown (as we will discuss 
in the next section). 


The importance of the highest weight state |A) = , comes 
from the fact that for any positive root @ we have 


Egg, =0, a>0. (11.63) 


This can be seen as follows. Using (11.29) and (11.55) we have 


[Hi, Exalém = (HiEta — Eta)om = aid, 
or, Hi(E+a¢m) = (Mm + 4i)(Etadm), (11.64) 


so that (Fi.¢yz) is also a simultaneous eigenstate with eigen- 
values (m; + a;). Correspondingly, we have 


H' (Bab) = (m' +a')(Eta¢bm). (11.65) 


On the other hand, for a > 0, we have a’ > 0, so that 


nig m’, m-a< m’, (11.66) 
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and we conclude that (E,¢y) has a weight higher than ¢)¢ 
while (E_.¢,z) has a weight lower than ¢,y. Namely, EF, acts 
like a raising operator while E_,. acts like a lowering opera- 
tor for the weights. If we identify M = A, this would say 
that (Ea¢,) will have a higher weight than ¢, which would 
contradict the assumption that A is the highest weight vector. 
Therefore, for the highest weight vector A, we must have 


Eada = 0. 


Let us illustrate some of these ideas with the example of LD = 
su(2). This is a rank 1 algebra (namely, @ = 1, see (11.4)) with 
the unique positive root vector a. From (11.29) and (11.45) we 
note that we can write the commutation relations as 


(Hy, Fg] = +(a,a)E La, 


[Eo, Ba] = Ha, (11.67) 
so that if identify 


1 2 
Ha, Jz = Esa, (11.68) 


Jg = 


(a, a) 
we recover the algebra of angular momentum operators, namely, 
[J3, J4] = tJ4, 
[Jy., J_] = 2J3. (11.69) 


We note from (11.29) and (11.67) that any given positive root 
in a simple Lie algebra satisfies a su(2) algebra. In the case 
of su(2), as we know, J; and J_ are respectively the raising 
and the lowering operators and the highest weight state in any 
representation has the form |j,7), namely the state with the 
maximal projection m = j. 


For any weight vector M, as we have shown in (11.40), we have 
2(M, a) 
(a, @) 


Moreover, we also know that there exist ¢ fundamental weight 
vectors (Ay, Ag,:-- , Ag) satisfying 


2(Ai, aj) 


(aj, 05) 


= integer. (11.70) 


=6 5, if =1,2,---,% (11.71) 
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and the highest weight vector A of any finite dimensional irre- 
ducible representation can be expressed in terms of these fun- 
damental weight vectors as 


N=k,Ay + koAg +--+ + keAg, (1172) 


where ky, k2,--- ,kg are nonnegative integers. (Note that we 
did not use the symbol k’ here for conventional reason.) For 
explicit forms of A; in terms of the fundamental root vectors 
qa;, the reader is advised to look at the book by Humphreys. 


We remark here that given any weight vector M and any root 
vector @, we can define 


2(M, a) 


M, =M- 
(a, a) 


a, (11.73) 


which also defines a weight vector. The transformation M — 
M, is known as the Weyl reflection of M by the root @ (note 
that (Ma,a) = —(M,a)). The Weyl reflection is very useful 
in deriving several results. As a simple example, let us note 
that if we choose (in the adjoint representation) 


then, (11.73) leads to 


2(aj, 5) 
. @ : (a;,a;) : 


which is expected to be a weight vector. However, if (a;,a;) > 
0, the vector a;; is neither positive nor negative. Therefore, 
this implies that 

(aj;,a;) <0, (11.76) 
which, in turn, leads us to conclude that (see (11.48)) 


Ki <0, i Fj. (11.77) 


Without going into details, we simply note here that if we fur- 
ther use the Cauchy-Schwarz inequality, (11.50) results. 
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10. Second order Casimir invariant of L: Let us collectively 


denote (Hj, Eq, Eq) as X,,u = 1,2,--- ,dimL and set (see 
(8.71) or (11.2) as well as (3.117)) 
Juv = Tr (ad X,, ad X_), 


dim L 


i= S> gh Xi Xi: (11.78) 
HVv=1 


Then, as we have discussed in section 3.3 (see (3.118) and 
(3.120)), Ig defines the second order Casimir invariant of L, 
namely, we have 


[I2, Xa] =0, (11.79) 


for any X, € L. In terms of H; and Eq we can write this 
explicitly as 


L 
p=) 9 Hil; +) (EoB-o + E-oEa) 
ij=l a>0 


L 
= \° g@HjHj+25~ E-aEa t+ ~ Ha, (11.80) 


where we have used the fact (see (11.25)) that gaa = 1 so 
that gv—-* = 1 and g%’ = 0 fora+G 40. 


For the highest weight vector A, we have (see (11.61) and the 
discussion in the following paragraph there) 


Mion = Mba, Eads =0, a> 0. (11.81) 


This leads to 


£ £ 
S- g Hi Hyon = (S- J) NiAj)OA = (A, A)da, (11.82) 


t,j=1 ij=l 
as well as 
L e . 
SS Hadar = SO: a' Hi )ba = Hoe a’ Xi )Oa 
a>0 a>0 i=1 a>0 i=1 
= D(A, a) oa = 2A, 6) oa, (11.83) 


a>0 
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where we have identified the vector 6 as 


6=5 ya (11.84) 


a>0 


Using (11.81)-(11.84) in the definition of Jz in (11.80) we obtain 
Ing, = ((A, A) + 2(A, 8)) dy, (11.85) 


so that the eigenvalue of J, in the irreducible representation 
with the highest weight vector A, is given by 


Ip(A) = ((A, A) + 2(A, 6)) 
= (A+6,A +6) — (6,6). (11.86) 


Weyl] has shown that the dimension of the irreducible represen- 
tation ¢, with the highest weight A is given by 


I] (a,6 +A) 
Dim ¢, = Wes) (11.87) 


a>0 


11.2 Lie algebra Ay = su(é+4 1): 


We note right in the beginning that, for complex algebras which we 
are considering here, su(¢+ 1) ~ sl(€+ 1) and, consequently, the 
identification Ay = sl(€+ 1) has been made in some literature. But, 
we will continue with su(¢+1) recognizing that the two are equivalent. 
It is convenient to study the general Lie algebra su(@+1),@ > 1 which 
yields su(N) for N = €+1. The Lie algebra of the generators A’, is 
given by (see (3.55) and (3.56)) 


N 
AN, = 0, 
p=1 
[A*,, A%s] = 6% AM, - oN, ATs (11.88) 
for p,v,a,0 = 1,2,---,2@+1. We note from the commutation relation 
in (11.88) that [A“,, A¥,] = 0 for any pair of fixed indices yu, v. There- 
fore, we can diagonalize A!,, A%,,--- Apis simultaneously. (Actu- 


ally, there are only @ such matrices which are independent because of 
the first condition in (11.88).) 
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Let us next consider the su(2) sub-Lie algebra defined by the 
generators (see (3.57)) 
oe 


which satisfy (see (3.58)) 


Pee 2s ees or Ae es ee (11.90) 


We have already seen in chapter 3 that the eigenvalues of A‘, — 
A”, = 2J3 must always be integers. However, we note that by letting 
1 > 2 > 8, we can also look at the sub-algebra defined by 

and conclude that the eigenvalues of the generators A”, — A®, must 
also be mLegers. Since A“, is diagonal for any fixed p, it follows that 
(AM, — A“*! ) also commute for different values of js and there must 


w+1 
be a common eigenvector ¢ satisfying 


(A4,— AME b= md, w=1,2,---,4 (11.92) 


for integral eigenvalues A,,. Let ¢’ denote another simultaneous eigen- 
vector satisfying 


+1 
(At, — AM) =m,¢. (11.93) 
Let us assume that the two sets of eigenvalues M = (m1, m2,--- , me) 
and M! = (m{,m,--- ,m/,) are such that they coincide except pos- 


sibly for one (or more) component which satisfies mn41 > mi,,, for 
some value of n. In this case, we say that the weight M is higher than 
the weight M’ and denote this by M > M’ and also ¢ > ¢’. Among 
all possible weights M, there exists the highest weight denoted by 
(see discussion on weight vectors in section 11.1) 


A= (Ai, A2,°°° Ae); (11.94) 
satisfying 
(AM, — AMT on = Anas (11.95) 


such that A > M for any other weight vector M (we are discussing 
only finite dimensional representations here). As we will show shortly, 
the highest weight A is unique for any irreducible representation. We 
note that any A,, takes nonnegative integer values, namely, A,, > 0. 
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To keep the discussion parallel to section 11.1, we note that it 
is more convenient to identify 


Hy, = A",— AYN, w=1,2,-- 6 (11.96) 


so that we can write (11.92) as 
Hyd = Myd, (11.97) 


for any eigenvector ¢ with weight M = (mj ,mz,--- ,mzg). 
Using (11.88), we next note that 


[Hy, A%] = (AY, — AMT, A%] 


= (54,AM, — 64,A%,) — (S% 41 AMg — AM 11) 


a a f +1 Qa 
= (5%, — 8 — My +B") AS. (11.98) 


Therefore, for the highest weight state, we have 
Ay (A% ba) = ([Hps A%] + A%G Hy) On 
= (5%, ee een ae ru) (A% ba). (11.99) 


We note from (11.99) that if w= a < 6, then we have 


H,,(A% ba) = (1+ 6 + Ay) (AX; a) (11.100) 


so that, for B > py, the eigenvector (A°%3 ¢,) has a weight which 
is higher than A itself. This wil be incompatible with our starting 
assumption that A defines the highest weight. Consequently, we must 
have A” B oa =0 for 6 > p. Since p is arbitrary, this implies that 


A% ¢, =0, a< 8B. (11.101) 


Since AX, < 6, raises the weight of the state, we call this 
the raising operator. Similarly, it can be seen, from (11.99), that 
A“, a > 6 lowers the weight of the state and, therefore is called the 
lowering operator. For example, for uw = 8 < a, (11.99) leads to 


Hp(A% $a) = (1 — 6%41 + Ap) (A% ba), (11.102) 


so that A°%;¢,,a > 8, corresponds to an eigenstate with a lower 
weight (than ¢,). We have already seen in chapter 3 that the oper- 
ators J, = Al, and J_ = A?, correspond respectively to raising and 
lowering operators of su(2). 


300 11 ROOT SYSTEM OF LIE ALGEBRAS 


We will next show that the highest weight vector ¢, is unique 
in any irreducible space and that the irreducible representation is 
completely specified by its highest weight vector as follows. Using 
lowering operators A%,a@ > 8, we can construct successively from 
the highest weight vector states such as 


oa, A% oa (a > B), AX AG, ba (01 > B1,02 > Ba), 
ibis) 


Let Vo denote the vector space spanned by all these vectors obtained 
from the highest weight vector through the application of the lowering 
operators. Let us assume that Vo corresponds to a sub-vector space 
of the irreducible space V. Apart from the vector ¢,, the rest of the 
vectors in the space Vo have weights lower (smaller) than the highest 
weight A. Therefore, we have 


A% Vo GV, for a> 8B, (11.104) 


since in this case A®%, corresponds to a lowering operator. Similarly, 
H,,Vo © Vo since each vector of Vo is a weight vector. Therefore, we 
need to prove 


A%Vo C Vo, (11.105) 


even for the case of raising operators for which a < £. 

We have already shown in (11.101) that A%;¢, = 0 C Vo for 
a < 8. Clearly, any number of raising operators np plied to the highest 
weight vector will also lead to the null vector which is contained in 
Vo. We next note that, fora < 8,u>V, 


As Cs Pa) = [A%s, AM eat Ar “B ba) = [A%, AY] ON 
(5%,4% = 54, A", | eve (11.106) 
(Ifa < Gand p < v, they are both raising operators and the left hand 
side trivially belongs to Vo as argued above.) The final identity follows 
because if the right hand side in (11.106) contains a raising operator, 
it will be trivially contained in Vo. However, if the right hand side 


contains a lowering operator, it will be contained in Vp because of 
(11.104). Similarly, we can write, for a < 8, u; > 44,1 = 1,2, 


A% (A AM Ae OA) = [ A AM? ox + AML Ae (A% oa) 
= [A%, AMD, J AM, ba + AMD, [A%, AM] G0 S Vo. (11.107) 
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This follows because if the commutator [A%, Afi] in the first term 
contains a raising operator, then we can reduce it further using (11.106) 
to show that it is contained in Vo. If it contains a lowering operator, 
it is also contained in Vo by (11.104). Similarly, if the commutator 
[A%,, A“?,| in the second term contains a raising operator, it is triv- 
ially contained in Vo while if it contains a lowering operator, it is 
contained in Vo by (11.104). This procedure can be generalized to 
a product of the form A%Q(AD, ... Al? oa) to show that it is also 
contained in Vo. Therefore, (11.104) and (11.105) together show that 


A% Vo © Vo, (11.108) 


for any A®; € su(é+1). (We note here that the elements of the 
form A®%, are diagonal, as we have mentioned earlier, and leave the 
space invariant.) Therefore, we have shown that Vo is an invariant 
subspace of the irreducible space V. Then, by the definition of the 
irreducibility of a space, we must have Vo = V since Vo 4 0 (namely, 
since ¢, € Vo). Furthermore, this implies that the irreducible space V 
is completely determined by its highest weight vector ¢, by applying 
the lowering operators. Therefore, it must be unique since any other 
highest weight vector, obtained by a different lexicographic ordering 
of 1,2,--- ,@+ 1, must lead to the same space. In order to make 
contact with the method of Young tableaux, we recall from (3.54)- 
(3.55) that we can write the eigenvalues of A’, in the form 


+1 


1 
11.109 
Ay, 74 d ay, ( ) 


for some integers a). This follows from the structure (3.54) as well as 
because (A", — A“ a 1) has integer eigenvalues and that 7, = 
(see (3.55)). Therefore, for the highest weight vector, we can write 


AM on = fuda, (11.110) 


where we have identified 


é+1 


fu=taR- eal df (11.111) 
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with the parameterization of f,, in the form 


fi = Av +AQ+AR4+++++ Ae + Avg, 

fo = Ag FA Pet te + Aga; 
(11.112) 

fe = Ae + Aeqi, 

fei = Att. 
We note that the reason for parameterizing f,, in this manner is to 
have a natural ordering in the eigenvalues, namely, f; > fo >--- > 
fe+i as well as to have f,, — fu4i = Ay, namely the components 


of the highest weight. We note from (11.92) or (11.94) that the 
weights have only € components while in (11.112) the eigenvalues 
have (€+1) components. This has its origin in the fact that although 
there are (€+ 1) diagonal generators, only @ of them are independent 
because of the traceless condition (3.55). Therefore, we can think of 
New, = —(Ar+A2+-+:+Ac¢). Furthermore, because the eigenvalues in 
(11.111) are invariant under an arbitrary constant shift f,, > f,+A, 
we can always choose Ap, = 0 if we wish. 

For the tensor representation of su(N) (N = €+ 1) with the 
Young tableaux index (fi, fo,---,fn) with fy > 0, the highest 


weight vector ¢, can be identified with the tensor 


ON — Deg oct 22-..2-+» NN---N* (11.113) 
ed 
fi fa fn 


For example, let us compute the eigenvalue of the second order Casimir 
invariant of su(V) 


N 
k= So AM AY, 


pv=1 
N N 
= S>(AM,)° + S> (At, AY, + AY, A). (11.114) 
p=1 U<v 


Using the commutation relation (11.88), we note that we can write 


N N 
> AL AY, = >) (AY AY + [44 4%) 
U<V P< 
N N 


=> AY AB, + > (5% AH, — 64,4”) 


U<v U<v 
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N N 
=> AAU +> (AX = 


U<vV LCV 
N 
=A + DUN way DoD 
U<vV v=1 
N 
= YA + 0v —2+1)A (11.115) 
u<v pol 
Substituting this into (11.114) we obtain 
N N 
Ip=S_ ((A4,)? + (N — 24+ 1)A4,) +250 AY AM. (11.116) 
p=1 p<v 


As we know, the highest weight vector satisfies (see (11.101)) 
AY, ba = 0, MSY, 
so that we see from (11.116) and (11.110) that 


N 
Thon = D> (Chu)? + (N — 20 + Di) bn- (11.117) 


p=1 


As a result, we obtain the eigenvalue J(p) for the particular repre- 
sentation (given by the highest weight vector ¢,) to be 


n= (4+ 83 ) ae H) 


p=1 p=1 
N 
ie N 
= S° (G4) - o Ae) (11.118) 
p=1 
where we have identified 
Se N+1 epee 
acl oe er eae fe | ar, mn ee (11.119) 
A=1 


which satisfies 


N 
SoG, = 0. (11.120) 
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This reproduces the earlier result (see (10.148) and the first relation 
in (10.150)) except for an overall normalization constant N. 

Let us recall that the generators A’, of su(€+ 1), for up < v 
are known as raising operators. However, among all such raising 
operators, there exist some very special raising operators given by 
Aye APs Ae pc AS a The reason is that all other raising operators 
can be obtained from these through the commutation relations of 
su(@+ 1). For example, we note from (11.88) that we can write 


Als = [A’y, A%3], Aly = [A’, 4°] > [[A"s, A], A°u], (11.121) 


and so on. We call these @ raising operators to be simple. Let us 
next introduce (€+1) orthonormal unit vectors e),e2,--+ ,e¢41 in an 
(€+ 1) dimensional Euclidean space satisfying the inner product 


(e;,e%) = jn, Jk=1,2,--- 041. (11.122) 
Let us also define @ independent vectors from these as 
Q,=e,—e2, A2=e2—-6e3, +++, ag=ec—egyy. (11.123) 


We call a 1,@2,---,@~ to be simple roots of the Lie algebra (see 
discussion on simple roots in section 11.1). Let us next identify 


—~.-— A jt+l 
Hew A, 


Bee As, (11.124) 


where j,k = 1,2,---,@+1. (We may set Hpi; = 0.) The second 
definition in (11.124) leads to 


A’ = Pas; Ate = Eas, ponies) Avia = iy, 

AS Bag. Ape E ay “mag ASH 2 ae. (105) 
while we have 

AX = Baw: Ary Tos Ae eee, (11.126) 


and so on, which can be compared with (11.121). Let us make the 
identification 


Qik = €j — ek, (11.127) 


and call it a positive or a negative root depending on whether 7 < k 
or 7 > k. Recall that any positive root can be expressed as a linear 
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combination of simple roots a1, @2,--- ,a@¢ with some nonnegative 
integer coefficients (see (11.44)). 

From the commutation relations in (11.88), we can now find 
relations such as (see (11.29) and (11.32)) 


[Eo Eg] = Nag Eatg, forat+ 640, anda+ a root, 
[Pag =a] = He, 
[Ha, Eg) = (a, B) Eats, for a+ a root. (11.128) 


Figure 11.2: The Dynkin diagram for the simple roots of Ag = su(@+ 
1). 


From the definitions in (11.122) and (11.123), we see that 
(a;,a;) = 2, GS 12 SOL, 

(aj, aj41) = —1, 

(a;,a%) =0, if |j—k| > 2, (11.129) 


for simple roots. These relations are generally depicted by the Dynkin 
diagram as shown in Fig. 11.2. 

Finally, we simply introduce the notion of the fundamental weight 
system (Aj, Ag,--- , Ag) given by 


+1 


Ar = e1—zy > ee, 
. o 
Ay = ey+e2.- py ek, 
CHT 2 (11.130) 
7 , 
Ay = ey reg ts +ee— ay Do ek; 
k=1 
which satisfy 
2(A; 
2(Aj, OK) _ ky f= 1,2,-°+ 8 (11.181) 


(aK, A) 
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If we replace e; by a; in (11.130) using (11.56), we can also write 
bli 
J e4+1 
=a 
fd 


(ay + 2a9 +--+ + ja;) 


+ ((2— j)ajsi + (2-9 —Lajpa +++: +a). 
(11.132) 


With this, one can show, using the general Cartan-Dynkin theory 
(see section 11.1), that the highest weight A of an irreducible repre- 
sentation characterized by A = (Aq, A2,--- , Ag) is given by 


A = dy Aq + AnpAg +--+ + ApAg, Aj > 0. (11.133) 


11.3 Lie algebra D; = so(2¢): 


As we have discussed in (3.23) and (3.24), the Lie algebra of so(NV) 
is given by 


Jik = —Ikj, Gh Ss Ue 2 00 oN, 
Bite Jom = —0;¢Jkm = SkmI je + OjmIKe + Ones jm: (11.134) 


This will describe the Lie algebra so(2¢) with the identification N = 
2¢. However, this form is not suitable for the discussion of its irre- 
ducible representations. Therefore, we change the basis of generators 
from Jjz,j,k =1,2,--- ,2€to aset of linearly independent generators 
consisting of R“”, Ru, and X‘, with u,v =1,2,--- ,¢ defined by 


1 : Vv 
RO = 5 (Sue — Iutrev40) — US +e + Jutey)) = —R, 
1 : 
Ry = 9 Cire = Jutev+e) + AC ieee, a Jui) = — Rup, 
1 ; 
Ki (Gnd ay ee Ded? (ATA85) 


2 
Relations (11.135) can also be inverted to obtain 


Iw = ; (RNY Rw +X", X",) 5 


1 
Ju tev+e > 9 (RY + Ruy i ea -2 x) ’ 


(RY + Ruy — X" 


a 
9 a) ? 


Jutty = 5 (RY Rye eX) (11.136) 
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for p,v=1,2,---,2. 
The commutation relations for the new basis can now be ob- 
tained from (11.134) and take the forms 


Bore X% = 5° X", = OAXS, 
Do der Soa Ree OF ROM 
X",, Rag] = —0", Rug — 5 4Rav, 


Ruy, RP] = 6%, X8, + 6° X% — 6% XP — 68 Xe, 


Rushes =0= (RR; (11.137) 


for p,v,a,8=1,2,--- ,2. 

The advantage of the transformation (11.135) or (11.136) can 
be understood as follows. Since so(2N) generates rotations, the gen- 
erators J; have the coordinate representation given by 


) 0 
Se cee eee eerie k= 12.2.--- 26. 11.1 
“I Oa, "Fa," jk 94) ; id ( 38) 


Tik 
If we now introduce a complex coordinate z, through the relation 
Zp= fy tire, P=1,2,--- 2h, (11.139) 


then, we can express 


0 ) 
Ruy = “Hare = Y Oz 
0 0 
Re — —2z* Zz : 
¥ Oz, Oy 
Cae SS Zp 2 oe (11.140) 


Zz 
Le ’ 
OZ Oz* 


in terms of the new variables. Here “star” denotes complex conjuga- 
tion. 

We note (from the first relation in (11.137)) that the genera- 
tors X', define a Lie algebra u(@). We also recall that the unitary 
representation of so(2) requires 

Th = Sig = Tie (11.141) 
This determines that 


a ig) She. (RA Shia (11.142) 
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for a unitary representation. Let us note that, for our discussion, 
we can restrict ourselves to the case ¢ > 4 for the following reason. 
The representation for so(2) corresponding to @ = 1 is trivial (it is an 
Abelian algebra) and for @ = 2, we note that so(4) = su(2) © su(2). 
Indeed, for @ = 2, the two commuting su(2) algebras can be identified 
with the generators 


(i) : X), — X%, X"y, Xi 
(i) 2 XK RY Ra. (11.143) 


For the case of @ = 3 or the algebra so(6), we can define 


3 3 
1 1 
AM = XLS XA, A= TM, 
A=1 A=1 
eee ie 
At = 5 St ae AN =5 Se? Rag, (11.144) 
a,b=1 a,b=1 


with p,v,a,8 = 1,2,3 and #7 = Euag Aenoting the Levi-Civita 
tensor in 3-dimensions. These generators can be seen to satisfy the 
algebraic relations 


4 

ya =0, 

a=1 

[A%, A“ = 6%,A%) = JAG; (11.145) 
for a,b,c,d = 1,2,3,4. We recognize this as the su(4) Lie algebra 
and, therefore, we conclude that so(6) = su(4). Therefore, these 
cases need not be studied again. 

Let us assume that all the generators belong to some dx d matrix 


representation of some irreducible space V = {p}. First we note from 
(11.137) that 


[xe XY) =0, (11.146) 
for p,v = 1,2,--- ,éso that if we identify 

Ay, = X",, (11.147) 
we can write 


[H.., Hy] = 0. (11.148) 
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These define € commuting generators and, therefore, can be simul- 
taneously diagonalized. We will now show that the (simultaneous) 
eigenvalues of these generators are either all integers or half integers 
depending respectively on whether we have a tensor representation 
or a spinor representation. 

Let us recall that the generators 


Jg = 5 (X-X%4), IE =X, J-= X4, (11.149) 


satisfy the familiar su(2) algebra so that the eigenvalues of X1, — X%, 
is always an integer. Let us also note from (11.137) that 


[Riz, R'] = —6',X%, — 6X}, = -(X1,4+ X4), 
[Xi +X4,R¥)=2R", [X14 + X24, Rie] = —2Rie. (11.150) 


As a result, we can identify 
1 
i= 5 CPAs) 5 Se I Has (11.151) 


with the generators of a second su(2) algebra. We conclude from 
this that X+, + X%, also always has integer eigenvalues. Since X1, — 
X?, as well as X1, + X%, have integer eigenvalues, it follows that 
both X14, and X%, can have either integer or half integer eigenvalues. 
Carrying this analysis for w = 1,2,--- ,€, we conclude that each of 
the diagonal generators X", must have simultaneously integer or half 
integer eigenvalues. 

We can now introduce the notion of highest weight vector as in 
the last section (for su(@+1)) given by 


Hyon = OO (11.152) 


where f1, fo,--- , fe are now either all integers or half integers. More- 
over, we can now parameterize the eigenvalues in an ordered manner, 
as in the case of su(@+ 1), such that 


Ji2 jee f52 ee | Fi, (11.153) 


as we will shortly in (11.161). The root system for this case is a bit 
more complicated. We note from (11.137) that the (¢— 1) operators 
an XA, vee ae are raising operators just as in the case of su(@+ 
1). In fact, we see from the commutations relations (see the second 
relation in (11.137)) that R“” s are also raising operators. As a 


310 11 ROOT SYSTEM OF LIE ALGEBRAS 


simple root system of so(2@), we can choose the ¢ raising operators 
to correspond to 


Kop ak hee (11.154) 


In analogy with the su(¢+1) case, we introduce @ unit orthogonal 
vectors €1,€2,:-: ,e¢ in an ¢-dimensional Euclidean space (see also 
(11.122)) and write 


XS = Fie e)s Lb - V, 
RW = Ele. ves )5 oY, 
Rw = dy ek B5y WK. (11.155) 


Defining, as in (11.123), 


OQ, =e€1—e€2, O2=e2—€3, +++ ,ag1 =e¢_-1—ep, (11.156) 
but with 
ae =eg_1 + &g, (11.157) 


they constitute the desired ¢ simple root system of Dg = so(2@) with 


Fa, = ber b= 1, 2, nes jl - ale and Eas = ROU. 
(11.158) 


Correspondingly, we define 

A= fi- fe, Av=fo—fa, +++ Ae1=fe-1—fe, (11.159) 
as in (11.112), but with 

Ae = fe-1t fa, (11.160) 


so that all components of A = (Aj, A2,°-+ , Ae—-1, Ae) are non-negative 
integers. Explicitly, we can write (compare with (11.112)) 


fr = AptAg+Ag 4-05 + Aga + F(Ae-1 + AL), 
fa = do + AZ +++ + Ata + 5(Ae-1 +2), 
(11.161) 
fea -= Ne-2 + $(Ae-1 +d), 
fri = (Ac-1 + Ag), 


NIF bwle 


o> a (Ac — Av-1), 
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Qe-1 


ae 


Figure 11.3: The Dynkin diagram for the simple roots of Dg = so(2¢). 


which are all integers or half integers satisying (11.153). The corre- 
sponding Dynkin diagram is depicted now as shown in Fig. 11.3. 
The dimensional formula for the irreducible representation {p} = 


(fi, fa,--- , fe) is given as follows. First we define 
N 
Cn= fat 5 B= lett pe hens (11.162) 


which satisfy 0, > 02 > +--+: > o¢-1 > |og|. Then the formula for the 


dimension can be written as 
9f-1 £ 


d(p) = (2!)(4!) --- ((2@ — 4)!) 


((0;)" — (ox)?) , (11.163) 
j<k 


which is due to Weyl. 
We can also compute eigenvalues of symmetrical orthonormal- 
ized Casimir invariants [,(p) to be 


Fp) = (on)? - 5g N(N -1)(N -2), 
p=1 
£L £L 4 
Ty(p) = 5 (N?-N +4) So (oy)*- 2N-1) | 90 (4)? 
p=1 wal 
— (N —3)(N —8)(N? — 1)(N? — 4), 


L L 
- 5 (aN -1)(N?- 1 +8) ( ,)? (deo 
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L 
+ 7(N? — N—2) 3” ((o,)°)° 
p=1 


3 
= = @ N(N —5)(N — 32)(N2 — 1)(N2 — 4) 


x (N? — 9)(N? — 16), (11.164) 


with a suitable normalization. Of course, we have to identify N = 2¢ 
in (11.164). However, we note that the same result for the Casimir 
invariants holds even for so(2¢+ 1) with N = 2@+ 1 if we define o,, 
suitably as is discussed in the later section. 

As an example, let us consider the tensor representations with 
signatures {p} = (fi, fo,--: , fe) = (1,1,--- ,1) and (1,1,--- ,1,-1). 
They are realized as completely anti-symmetric tensors Tj, ,,...4, of ¢ 
indices which take values over 11, M2,°+: , We = 1,2,--- , 20, such that 
the representation is self-dual for the former and anti-self-dual for the 
latter. Namely, 


* van ee 
Type we = Eur pamevive-velyyveve = ET yy panos (11.165) 


where the star denotes the dual operation. For the spinor repre- 
sentations (a0 ,3) and (507° the they are realized as 
2&1 x 2! matrices (see section 6.3, in particular, (6.100) as well as 
(6.77)-(6.78)) 


1 
Tok ri (a oxo!) : 
7, 1/04 i 11.166 
Jik = ZT — FROG) » (11.166) 
where j,k = 1,2,--- ,2¢ and the matrices 0; and a} satisfy 
00} + ono! = 26;,1 =o! to, 11.167 
GO + OKO; = 20;n 1 = Oj 0K + 94,05, (11.167) 


as discussed in chapter 6 on Clifford algebra where the Dirac-Clifford 
matrix is written as 


0 oj 
w= | soe, pS Dee (11.168) 


a, 0 


Finally, we note that all even dimensional Casimir invariants 
[,(p) do not distinguish between the two representations (f1, fo,--- , fe) 
and (fi, fo,--- , fe-1, —fe) if fe 4 0 since then it follows from (11.162) 
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that og = fe. However, so(2¢) does possess an ¢-th order Casimir in- 
variant Ip(p) given by 


it 20 
Ip = (20)! Se Evi pave-peve I pry Jnigv2 BS Tupre- 


* prvi peve=1 


(11.169) 


It is known that its eigenvalue is given by (0, is defined in (11.162) 
and are not the matrices discussed in (11.166)) 


Ie(p) = o102--- 08, (11.170) 


which changes sign as fy > — fe. 
The highest weight vector can be written in the form (see also 
(11.133) 


A= fier + foeat--++ free 
= \y Ay + AgAg +--- + AgAg, (11.171) 


in terms of the @ fundamental weights A;, Ag,--- , Ag given by 


A,=e1, Ag=e,+e2, ++: ,Ag2=e1 +e2+--++ e429, 
(11.172) 
and 
1 
Ap-1 = 5 (ey + eg +--+ + e¢_2 + ep_1 — &), 
1 
Ayg= 3 (ey +eg +++: +ep_9 + e¢_1¢44 e;) : (11.173) 


11.3.1 Dy = so(8) and the triality relation. In order to explain the 
triality relation, let us first introduce the notion of inner and outer 
automorphisms of a Lie algebra. For any Lie algebra L of dimension- 
ality M and with the commutation relation 


M 
be. S Le one. se (11.174) 
A=1 


the transformation 


X, > Xi), =9Xyg €L, (11.175) 
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where g is an arbitrary element of the associated Lie group g € G = 
exp(L), is clearly a symmetry of (11.174). Namely, it satisfies (recall 
the case of Yang-Mills gauge transformation) 


M 
Behe. Gil ai Cree. Se (11.176) 
A=1 


Such a symmetry is known as an inner automorphism of the Lie 
algebra. In contrast, when transformations X, — Ai preserving 
the Lie algebra structure cannot be written in the form (11.175), 
then such transformations are known as outer automorphisms of L. 
Therefore, the outer automorphisms are unanticipated and special. 

For example, consider the Lie algebra of su(V) described by (see 
(11.88)) 


N 
> AM, =0, [A4,, A%] = 6%.AM, — 5H, A%,. (11.177) 
p=1 


In this case, we note that the transformation 
At, > Alt = — AY, (11.178) 


preserves the Lie algebra structure (11.177). Namely, the new gener- 
ators satisfy 


N 
do 4% = 0, [A AS] = 054" — 54,49. CEA) 
pw=1 


However, the transformation (11.178) defines an outer automorphism 
of the su(V) Lie algebra for N > 3, while it is an inner automorphism 
for su(2). This can be easily seen as follows. Consider a tensor 
representation of su(N) of the form T1020". With respect to the 
transformed basis (11.178), the tensor representation transforms to 
its complex conjugate 

THba in TIM DULY 2 Ym (Ee ee). . (11.180) 


Y4AV2Q°-Um YAV2°-Um Hiba in YjV2°-Um 


Since the complex conjugate representation of su(N) is an inequiva- 
lent representation for N > 3, the transformation (11.178) cannot be 
an inner automorphism. On the other hand, for su(2), any irreducible 
representation is real or pseudo-real so that the complex conjugate 
representation is an equivalent representation and correspondingly, 
the transformation (11.178) is an inner automorphism in this case. 
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Let us next consider so(V) Lie algebra given by (see (11.134)) 


Tin = —Ikj, 

Watrlecnl = Ogden — Sandal Oper + Omed as (11.181) 
where j,k, @,m = 1,2,---,N. It is known that any permutation o 
of the N indices 1,2,--- ,N gives an inner automorphism of so(N) 


corresponding to the symmetry group Sy of N objects (see section 
1.2.1). In contrast, any symmetry of the Dynkin diagram of a Lie 
algebra is known to give an outer automorphism of LD. Let us con- 
sider the Lie algebra D4 = so(8) for which the Dynkin diagram is 
shown in Fig. 11.4. It is clear that this diagram is invariant un- 
der the symmetric group $3 of interchanging the three simple roots 
Q1, 03,4 (leaving a2 fixed). As we will see shortly, this leads to an 
interesting result that, for so(8), there is no essential difference be- 
tween the vector and the spinor representations, both of which are 
eight dimensional. 


Figure 11.4: The Dynkin diagram for the simple roots of D4 = so(8). 


To see this, let us make a transformation 6 given by 


4 
1 
RY ais Riv = =) eavapie”, 


a,B=1 
6 i 
a cea al (11.182) 
a,p= 


for evs= 1,923, 4. Here e070? = Eyvag denotes the completely anti- 
symmetric Levi-Civita tensor in 4-dimensions. It is now straightfor- 
ward to verify that the commutation relations (11.137) are preserved 
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under the transformations @ given in (11.182). We will see shortly 
that 6 denotes an outer automorphism of D4 = so(8). 

Let P denote an inner automorphism of so(8) which interchanges 
the indices 1 + 4 and 2 + 3 and define 


tT = P#, (11.183) 


so that we have 


A=1 
> ee ai 
alae peer ee 
at 
ie ER ee 
Rw > 5 Serer Res (11.184) 
a,B=1 


where we have identified ; = 5 — » so that 1 = 4,2 = 3,3 =2,4=1. 
Finally, we introduce another outer automorphism 7 defined by 


1. For w# 4, F 4: 


m(X,) = reap m( REY) = As m(Ryv) = Ryv- 
(11.185) 


2. For w= 4, #4: 


m(X4)=Ry, 1(R”)=R”, 2(Ry) = X4,. (11.186) 


3. For p#4,v = 4: 


™(X"4) = —R!, m( RM) = =X, m(Rya) = Rya. 
(11.187) 


a(X%,) =—-X4, (11.188) 


where we recall that by anti-symmetry R*4 = 0 = Ray. 
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Let ¢o be the highest weight state of an irreducible representation 
{p} specified by (f1, fa, fs, fa). Then, we can verify that 7 defined 
by (11.185)-(11.188) leaves ¢o invariant, but 


fof GFA), fs > -Sa, (11.189) 
which implies 
o; > 0; (j #4), o4 *} —04, (11.190) 


where we have identified o,, = f,,+4—p,u = 1,2,3,4. It follows now 
from (11.159)-(11.160) that under the operation 7, 


3 > Ma, (11.191) 


while A; and 2 are unaffected. In other words, 7 interchanges the 
simple roots a3 © a4 in the Dynkin diagram Fig. 11.4. 

Similarly, the action of 7 defined in (11.183) induces the trans- 
formations 


TT oo 1 
fiwh= ait At fa— fa), 
y: ae 1 
fo > fe= ait f— fst fa), 
= 1 
if f= 5h 1—fot+fs+ fa), 
r 1 
hoa fia a it area) (11.192) 
or equivalently 
y ee 1 
01 +01 = 5 (01 +02 +93 — 94), 
7 1 
02 + 82 = 5 (01 +02 —93 +4), 
1 1 
03 +03 = 5 (91-92 +93 +04), 
7 1 
4 + 04 = 5 (— 01 +02+03+04). (11.193) 


Equation (11.193) can also be equivalently written as 


ee eg 


i 5 (91+ 2 + 93 + 94) — 05-p, (11.194) 
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which preserves the ordering 


01 > 52 > G3 > |G4I. (11.195) 
In terms of A,,s, this can be written as 
A & ds, (11.196) 


while leaving A2, 44 invariant. Therefore, the operation 7 implies the 
permutation of the two simple roots a; and ag in the Dynkin diagram 
Fig. 11.4. 

It is clear now that we can identify the 3! = 6 elements of the 
symmetry group $3 (see equations (1.13)-(1.16) in section1.2.1) asso- 
ciated with the Dynkin diagram Fig. 11.4 as 


1. P, = 1 = identity, 

2. Po =7:03 6 4, 

3. P3=T: 0a, 6 a3, 

4. Py = ath =TIT: 0, 6 a4, 

5. Py =T7 2A, 7 03 4 a4 > AY, 
6. Po =17T: a, 9 a4 9 a3 > Q4. 


Especially, from (11.189) and (11.192) we have 


1 + (1 
W ( ,0,0,0) ( ,0, 0,0); (G95) 5) 5) (57575 5) 
Lt 
1 a rents ec 
7: (1,0,0,0) + (5,555.-5)3 
lili 111i 
. 11.1 
ish 5% 53'S) (575 525) ( 97) 


In other words, three 8-dimensional irreducible representations, cor- 
responding to a vector, a spinor and a conjugate spinor, transform 
among themselves under the symmetry group $3. This is the well 
known manifestation of the triality principle of D4 = so(8). We note 
that the dimension d(p) in (11.163) is invariant under 54 (the per- 
mutation group of four elements) and so is the second order Casimir 
invariant J2(p) in (11.164) since 


Oo, oe tos bolo, +o; bes oy (11.198) 
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as we may easily verify. However, this is not the case for I4(p) in 
(11.164) as well as for I4() in (11.170). For example, using (11.190) 
as well as (11.193) we obtain from (11.164) and (11.170) that 


m: Iy(p) > Lap), La(p) > —Li(p), 


2 2 
as 1 ie 
Ia(p) > = 5 Tap) +5 La(p). (11.199) 
30 2 
Here we have used the relations 
2 
4 ic 3 [2 
= NA 4 2 
So (Gu)* = ay > maar So (on)? | —6o1020304, 
p=1 p=1 p=1 
2 
ie i ae 1 
Bees ee 4 
71020304 = <3 So (on) + 16 rw + 5 01020304. 
w= wal 
(11.200) 


It is clear from (11.199) that I4(p) and I4(p) behave as a two-dimensional 
irreducible representation of the symmetric group $3. In contrast, the 
6-th order Casimir invariant Jg(p) in (11.164) remains invariant under 
S3 by the identity 


p=1 p=1 v=1 
4 4 4 

=45 “(o,)° —5'| > (oy) (S10) , (11.201) 
p=l pw=1 v=1 


This is indeed expected since the sixth order Casimir invariant of 
so(8),namely, Ig(p), is unique. We note that our arguments show 
that the permutation symmetry group 53 is an outer automorphism 
of the Lie algebra so(8) since any inner automorphism will preserve 
any p-th order Casimir invariant I,(p). (Reference: S. Okubo, J. 
Math. Physics 23, 8 (1982)) 


11.4 Lie algebra By = so(2¢ + 1): 


We note here that the Lie algebra De = so(2¢) studied in the last 
section is a sub-Lie algebra of Be = so(2¢+ 1). The number of 
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generators in By correspond to ((2¢+ 1) while that in Dz is ¢(2@—1). 
Therefore, as in the case of De = so(2@) (see (11.134)-(11.137)), we 
again change basis of the Lie algebra from Jj,,7,k =1,2,--- ,2@+1, 
to X',, RM’, Ruy, u,v = 1,2,--- ,£. The remaining 2¢ generators are 
identified with 


1 . 
By = 5 (Sure2e+1 — tJ y2041) 5 


1 : 
BY= 5 (Su4e,2e41 oF tJy20-41) » b= 1, 2, aa oe (11.202) 


In this case, therefore, in addition to the commutation relations given 
in (11.137), we also have commutation relations involving B, and B". 
These can be obtained from the definitions in (11.134) and (11.202) 
and are given by 


X#,, By] = —6%, By, 
Xt B*) = 6%, B", 
Re, By] = 64, BY + 6%, Be, 


Ry B |= 04, By = 05 Ba 


RY B20 = |B Bil 

Bu, By] = —2Ryv, 

BY, BY] = 2RHY 

B,, BY] = —2X”,. (11.203) 


The unitarity condition for the irreducible representation re- 
quires that 


(B,)' =B", (BY) =B,. (11.204) 


Furthermore, we note that the mutually commuting Cartan sub- 
algebra still consists of the generators H, = X‘,,u = 1,2,--- ,2. 
However, in this case, we note from (11.203) that the generators BY 
also act as raising operators. On the other hand, we note that the 
set of raising operators X fa XxX a 1 SX a ' and B* can now generate 
all the raising operators. For example, we note from (11.203) that 


1 
Role — 3 [BoB = = (xe, BB, (11.205) 


1 
2 
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and so on. Therefore, the set of raising operators 
l-— 
Ky Koei eB (11.206) 


can be thought of as fundamental and the simple root system will 
change when we take this fact into account. 

As before, we introduce @ unit orthogonal vectors e1, €2,--- ,e¢ 
in an é-dimensional Euclidean space and write (see (11.155)) 


AY = LG, =6,)3 LAY, 

RW = Ween MSY, 

Rig ie oes u<y, (11.207) 
as well as 

BY =F, By = E-e,- (11.208) 


Correspondingly, the simple root system (see (11.156), (11.157) as 
well as (11.206)), in this case, is given by 


a1, =e,—€2, Ag=&€2—€3, ++: , Me] = €f-1 — €@, 
ae = eg. (11.209) 


As a consequence, the fundamental weight system is given by (com- 
pare with (11.172) and (11.173)) 


A, =e), Ag =e, +e, --- ,Ag-1 =e1 +e2+---+e¢-1, 
1 
Ay = a ieish Coren): (11.210) 
The highest weight vector can now be written in the form (see (11.171)) 


A= frei + foeo +--+ + free 


= Aah F Aaachs oe AEG, (11.211) 
with 
fir = ArtrAgt Aa +--+ Aga + bry, 
fr = Not gt Pye oy, 
(11.212) 
fa = Ne-1 + § Xe, 


fe — 5 Xe, 
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The fes are again all integers or half integers satisfying 


he foes 2 fis fee 0. (11.213) 


We note that fp does not take negative value in this case consistent 
with the fact that so(2@+1) does not have any self-dual representation 
in contrast to so(2@). 

As in (11.162), we define 

N 20+ 1 

Gu tat US far ge (11.214) 
with w = 1,2,--- ,@ With this, the p-th order Casimir invariant 
of so(2@ + 1) can be checked to have the same form as in (11.164) 
for so(2¢), but with N = 2¢+ 1. However, Weyl’s formula for the 
dimension of a representation now has the form (see (11.163)) 


2 u 
de) = Gyan ((2@ — 1)!) Il, He — (o,)") 


Figure 11.5: The Dynkin diagram for the simple roots of By = so(2@+ 
1). 


The Dynkin diagram for By = so(2¢+ 1) is shown in Fig. 11.5. Here 
we have represented the root ay = eg (see (11.209)) by a black vertex 
(dot) to distinguish it from the other roots since (az, az) = 1 while 
(aj;,a;) = 2,7 # &. Furthermore, conventionally the number of lines 
(bonds) connecting two neighboring roots a; and a;+1 is determined 
as (see (11.54)) 


2(aj, Oj41) 


V (25, 5) (Qj41, 0541) 


Ny.j+1 = = Nj+1,5- (11.216) 


Therefore, it follows that the number of lines (bonds) between the 
roots @g_; and ay in Fig. 11.5 is given by 


2(ap_1, Oz) 


J (ae-1, Op_1) (Qe, Qe) 


ne-1,¢ = 
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11.5 Lie algebra Cy = sp(2¢): 
Let us consider a set of anti-symmetric constants 
jk = —€kj, jk =1,2,--- , 20. (11.218) 
Then, the Lie algebra sp(2@) is defined by the generators Jj, satisfying 
(1) Jjn = Jey, 
(it) [Jjn, Jem] = €jeJkm + €emJje + €jmIne + €eeTjm, —_—- (11.219) 


where j,k, @,m =1,2,--- , 28. 

However, the form of the algebra in (11.219) is not very conve- 
nient for the discussion of its representations. So, we change basis 
and define new generators in the Cartan basis 


v vy 1 } 
SUN = SE = S(T — Iurev+e) — USnv+e + Iuter)) 
1 2 
Suv = 9 (Suv — Iutev+e) + Invte t+ Inter)» 


= 


1 . 
v=5 (Se + Fusevre) bts pyre — Jutep))s (11.220) 


where p,v = 1,2,--- , 2. Furthermore, let us choose the anti-symmetric 
constants, €j;x, of the form 


Exptl = —Evte = 1, €j,=0, otherwise. (11.221) 


With these, we can now calculate the commutation relations in the 
new basis using (11.219) which leads to 


KG XG SOM pO UKS, 
X*,, Sap] = — 6", Sip — Og Sew, 
XE S| HO See ore, 


Spv, S| = —69,X8, — 6% XP — 68 X% — 5X4, 


Sia Saak=0= |S" 9°], (11.222) 


where p,v,a,8 =1,2,--- ,2. 
Let us next define 


1 
Ge. ise eG. (11.223) 


1 
Be ik. Ie= 
i a ae 
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It follows now from (11.222) that these three generators define a su(2) 
Lie algebra given by 


[J3, Jt] =+J4, [J+, J-] = 2Js, (11.224) 


so that the eigenvalues of X}, are all integers. Similarly, we can 
show that the eigenvalues of each of the generators H,, = X‘',, = 
1,2,--- ,@ are all integers. We can now define the highest weight 
vector da, to satisfy 


Hyon = X" on = fuda, w=1,2,--- ,2, (19.225) 
with the (integer) eigenvalues satisfying 


fie i TU, (11.226) 


To obtain the root system, we again introduce an ¢-dimensional 
Euclidean space spanned by the orthonormal vectors e),€2,--- , eg. 
We note that we can identify the fundamental raising operators of 
the system to consist of 


Bay = Fis = Pee 
(11.227) 


which can be seen (as in (11.205)) to generate all the other raising 
operators X", with  < v as well as 5sKy . Here we have identified 
the simple roots with 


a1 =e, —€2, 2 = €2—€3, *** , Aeg_1 = €¢_1 — &f, 
ay = Dey. (11.228) 
This allows us to write the fundamental weight system as 
Ay=e;, Ag=e,+e.,, Ag y=e1+eg+---+ee-1, 
Ag =e, +eg+---+e¢-1+ 2, (11.229) 


so that the highest weight vector can be written in this case in the 
form (see also (11.211)) 


A= fier + foes + fge3 +--+ + free 
= \, Ay + AgAg + AgAg +--+ + AgAg, (11.230) 
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with 
fr = ALH+AQ+ AR H+6°° + Ag-1 + Ae, 
fo = A2 + ARB +++: + Ag-1 + re, 
(11.231) 
fei = re—1 + re, 


fe = re, 


Correspondingly, the Dynkin diagram for Cy = sp(2¢) is shown in 
Fig. 11.6. 


Figure 11.6: The Dynkin diagram for the simple roots of Ce = sp(2¢). 
Let us next define 
Th as oe et 8 ar el eA ee 2 (11.232) 


With this, Weyl’s formula for the dimension of a representation (com- 
pare with (11.215)) is given by 


£ £ 


1 
d(p) = (1!)(3!) ee ((20 = 1)!) Ul On I] ((o,)? | (o)") . 


=1 p< 


(11.233) 


Moreover the eigenvalues of the Casimir invariants, in a given repre- 
sentation, can also be computed to have the forms 
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L 
Io(p) = (20 + £44)(22 + £48) DS (Cn) (0)°) 
pw=1 
£ £ 
EON le) ea) Ves 
= v=1 


p=1 


= S7(0(0)? ( 
H=1 : 


sro?) |, 


+ 14(2é? + 2-1) 


(0 
v=1 
e 3 3 
Sin a 
p=1 


p=1 
(11.234) 
where we have identified (see (11.232)) 
0) =l-ptl. (11.235) 


We note that these formulae are exactly the same as for the Casimir 
invariants in the tensor representations of By = so(2@+ 1) (see dis- 
cussion around (11.214)) if we let 


(04,0) + (on - 5,01 — 5 
n (11.234). The reason for this is not well understood. 

Finally, we note that for the case of C2 = sp(4), the Dynkin 
diagram in Fig. 11.6 is equivalent to that in Fig. 11.5 for By = so(5) 
so that we must have Cy = Bg. Therefore, in discussions of Ce, one 
normally restricts to € > 3. The explicit equivalence of sp(4) and 
so(5) can be shown as follows. Let XL, Sip = Sup, S4” = S”*, wv = 
1,2, be the generators of sp(4). Let us define a new basis given by 


), (11.236) 


~ 1 ~ 1 
AS 5 (Xi +X%), X%= 5 (X2 — X}), 


s 1 + 1 

X= 59", M5 ots 

2s 2 1 7s " 1 

RY =—R = so Rig = —Ra = — 5522, 

Bi=S?, By=Sp, B?=X*%, Bo =X. (11.237) 


Then, it can be verified that the generators eae Rey = — Re Ruy = = 
yes Be and Bi, for u,v = 1,2 relaize the commutation relations 
of the Lie algebra By = so(5) given in (11.137) and (11.203). 
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11.6 Exceptional Lie algebras 


In addition to these classical Lie algebras, there are also Lie alge- 
bras known as exceptional Lie algebras conventionally known as the 
Go, Fy, Ee, E7 and Eg. We note here that the subscripts in the case 
of exceptional algebras correspond to their ranks (or equivalently, the 
number of simple roots). The standard manner of treating these ex- 
ceptional algebras is through the Cartan-Dynkin method discussed 
in section 11.1. However, we will discuss them in the following way. 

Let L denote any Lie algebra with a simple or semi-simple Lie 
sub-algebra Lo, namely, 


ero tos (esther (11.238) 


We can interpret (11.238) as follows. We can think of L as a vector 
space and denote the action of Zo on L through the relation 


a(x) =|a,z], for a€Llo, cxEL. (11.239) 


This, then, identifies L as a representation space of Lo. Furthermore, 
since we have assumed Lp to be simple or semi-simple and since 
any arbitrary representation of Lo is fully reducible by the Peter- 
Wey] theorem (see section 2.4), we can decompose the representation 
vector space L as a direct sum of irreducible components of Lo as 


L=lbhe)_\, (11.240) 


v7 


where V; denote the irreducible components of the representations of 
Lo so that 


[Lo, Vi] C Vi. (11.241) 


Furthermore, since L is a Lie algebra ({L, L] C L), it follows from the 
decomposition (11.240) that 


[Vis Vjl=aagt > — BighVer ag € Lo, (11.242) 
k 


for some constant coefficients 2;;, 3;;,.. The commutation relations 
of any Lie algebra LZ can be written in this way. Here we apply this 
method to exceptional Lie algebras since their Lie sub-algebras have 
been well studied in mathematical literature (for example, see the 
book by McKay, Patera and Rand). 
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11.6.1 G,. As a vector space, the Lie algebra LD = G2 is known to be 
14 dimensional. Furthermore, it is known that G2 is known to have a 
maximal Lie sub-algebra Lo = Ag = su(3) such that L = G2 can be 
reduced to a direct sum of irreducible representations of Lg = Ao = 
su(3) as 


14=84+3+43. (11.243) 


As we have seen, the octet is the adjoint (tensor) representation of 
su(3) while the other two correspond to the fundamental represen- 
tation and its conjugate representation. As a result, symbolically we 
may write 


Go={AH} @{A,} @{A4}, wy =1,2,3. (11.244) 


Explicitly A), A, and A“ satisfy the su(3) relations (see, for example, 
(11.88)) 


3 

ys At, = 9, 

p=1 

[As A%s| = 5 ANs a HAY, 

1 
[AY,, Ay] = —d"\ AL + 3 OK Ay, 
a 

[A#,, A*] = 3" AP — 3 5H AY, (11.245) 

Let us next consider the commutator [A,,A,]. As far as its 
tensor property is concerned, we can think of the commutator as 


coming from the direct product 3 @ 3 of su(3). On the other hand, 
let us recall the decomposition (see (5.93)) 


ileal) eo ® (11.246) 


for any su(NV). For the present case, the horizontal blocks (the first 
term) on the right hand side represent (2,0,0) = 6 while the vertical 
blocks (the second term) correspond to (1, 1,0) = (0,0, -1) = 3, com- 
ing from the decomposition 3 @ 3 = 6 + 3 (see, for example, (5.106)). 
However, for the present problem where L = Go, the exceptional al- 
gebra G2 does not contain a 6-dimensional representation of su(3) 
(see (11.243)). It follows, therefore, that we can write 


3 
[Ap, Av] =-2 ew A*, wv =1,2,3, (11.247) 
wed 
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where €,,,, denotes the three dimensional Levi-Civita tensor. We also 
note here that the multiplicative constant (—2) on the right hand side 
in (11.247) has been chosen for later convenience (the normalization 
is not relevant). Similarly, we can write 


3 
[A¥, A”] = 257 Ay, (11.248) 
A=1 


Finally, we must also determine the commutator [A“, A,]. We note 
that with respect to su(3), this commutator has the behavior coming 
from the decomposition 


3@3=861. (11.249) 


However, Gz does not contain a singlet component of su(3) (see 
(11.243)) so that we must have 


[AY A,] =3A", uv =1,2,3. (11.250) 


We note here that the multiplicative constant (3) on the right hand 
side is not arbitrary, rather it is determined from the Jacobi identity 
which must be satisfied by the commutators. This completes the 
construction of the Lie algebra Go. 

Carrying out the analysis as in earlier sections, we can show that 
the Dynkin diagram for G2 takes the simple form shown in Fig. 11.7. 
(Recall that su(3) has rank @ = 2 and, therefore, there are only two 
simple roots.) 


Figure 11.7: The Dynkin diagram for the simple roots of G9. 


For Go, if we identify 
3 42 2 
J3 = 5 Arg; dS AT, od = Ap, (11.251) 


then we can verify (see (11.245) as well as (11.250)) that they satisfy 
the su(2) algebra 


[Js, Jz] =+J4, [J,, J_] = 2s. (11.252) 
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As a result, we may choose the two generators of the Cartan algebra 
as 


Hi SA HAD), a ars (11.253) 


as well as the fundamental raising and lowering operators to be 


3 fil 
Tee = fe Ary, Fay = ‘E A’, 
3 49 1 
E_o, = 9 A 1 E_ay = 9 Ap. (11.254) 


We can now calculate directly and conclude (see (11.32)) that 


Hop Pe) =o Les leading to (a 1,a1) = 8, 
Oc re oye = Eas, leading to (Qo, a2 = 2s 
3 3 
Ho, , Eag| = = ss leading to (a ,a2) = 5s 
3 
ens Fo, — 9 Fox, 
| eee Ome ee | [Eom Bon] = Hoy: (11.255) 


This explains the reason for the peculiar Dynkin diagram for G2 in 
Fig. 11.7. 

The general irreducible representation of G2 is described by two 
non-negative integers 1, A9 > 0. We define 


1 4 1 
m=AMt srt e=OArt)) t+ ZO24 J), 


3 
1 1 1 
= — —-_= 1 
o2 grata 3 Az + J 
2 5 2 
03 =—-4 3 3 _ (Ay + 1) _— 3 (2 + Ly (11.256) 


so that 01 > 02 > 03 and oj +02 +03 =0. With these, the formula 
for the dimension of a representation is given by 


da) = = 010203(01 — 02)(a2 — 03)(01 — 03) 


9 1 


x (Ai + es + Ag+ 2)(2A1 + Ag+ 3) 
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= 720 2G +1)(A2 + 1) (1 + Ag + 2)(2A1 + A2 + 3) 


x (3A1 + A2 + 4)(3A1 + 2A2 + 5). (11.257) 


The second and the sixth order Casimir invariants can also be written 
as 


13(0) = (on)? > 
p=1 
a z "ogg 
I6(o) = -3 S (oyu) +10 | S(o,)? | + mar (11.258) 
H=1 y=1 


We recall here that G2 does not have any third, fourth and fifth 
order Casimir invariants (see discussion in the paragraph following 
(10.101)). The only nontrivial invariants are [2, Jg given in (11.258). 


11.6.2 F,. The exceptional Lie algebra L = Fy is known to be 52- 
dimensional and contains the 36-dimensional Lie sub-algebra Lo = 
By, = so(9). As the representation space of so(9), it decomposes as 


Fy = so(9)@ Ag, equivalently, 52 = 364 16, (11.259) 


where A, represents the 16-dimensional spinor representation of so(9). 
Let Xj, denote the generators of so(9) satisfying (see (11.134)) 


[Xj%, Xam] = —OjeFem — SkmJge + OjmIee + Sret jm, (11.260) 


with j,k,@,m = 1,2,---,9. Let wy, = 1,2,--- ,16 represent the 
spinor of so(9). Then, the rest of the commutation relations of Fy 
are given by 


[Xjx, Yu] = ye. (jk) vu? 


[Dy Ww] = 2 3 (Co5x) 4 X5k (11.261) 
j,k=1 
for j,k = 1,2,--- ,9 and p,yv = 1,2,---,16. Here oj, denote the 
generators of so(9) (see (6.103) where it is labelled J,,,) 


1 
5 v5 Yel, (11.262) 


Le 7 
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in terms of the generators of Clifford algebra 7; satisfying (see (6.1)) 


Furthermore, C’ represents the charge conjugation matrix associated 
with the Clifford algebra in 9-dimensions satisfying (see (6.70) as well 
as (6.72)) 


CC aay CPSC, (11.264) 


with T’ denoting the transpose of the matrix. The validity of the 
Jacobi identity 


[Xjes [bus Pr]] + [bys re, Xga]] + [vs [Xe Yul] = 0, (11.265) 


is guaranteed by the anti-symmetry under charge conjugation 


(CojK) uy = — (Cozi (11.266) 
while 
[bus fos Pall + fev [bas dull + fdas ys Yell = 0, (11.267) 
follows from the identity 
9 
DE U(Cojp)wn(aje)ar + (CojnurlOjx)ay + (Co;x)u(7jx)av] = 0, 
= (11.268) 


for u,v,a,\ = 1,2,--- ,16. We note here that the identity (11.268) 
can be obtained from the Fierz identity 


9 


2 S70 (Cojk) wok )ar 


j,k=1 


9 
=e ' (Cyj) AV (V5) — (Cy )apl¥pev] +3 (Cie oa Cy 0av) ; 
j=l 
(11.269) 


which, in turn, comes from the ortho-normal relations of the y; ma- 
trices. To conclude this discussion, we simply note that the Dynkin 
diagram for Fy is given by Fig. 11.8. (Recall that By = so(9) is a 
rank ¢ = 4 algebra.) 
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Figure 11.8: The Dynkin diagram for the simple roots of F4. 


11.6.3 Es. The exceptional Lie algebra L = Eg is known to be 78- 
dimensional and contains Lo = A; ®@A5 = su(2)®su(6) as its maximal 
Lie sub-algebra. Therefore, as a representation space, Eg decomposes 
as 


78 =18354+3@14+2@20=35+3+ 40, (11.270) 


and it can be thought of as being spanned by the generators X j pprk= 
1,2,--- ,6, BY, u,v = 1,2 and Ajxe,, where 


if x C As = su(6) and belongs to the space 1 ® 35 in (11.270). 
These generators satisfy the relations (see (11.88)) 


POX ROEM in = Orie (11.271) 
with j,k,@,m=1,2,--- ,6. 


2. BY, c Ai = su(2) and belongs to the space 3 @ 1 in (11.270). 
These generators satisfy the relations (see (11.88)) 


2 
ys BY, = 0, 
p=1 
[BY,, Bs] = 6°, B, — 6',BY, (11.272) 
with p,v,a,6 = 1,2. 


3. the tensor Ajx¢,,, is totally antisymmetric in j,k,¢=1,2,--- ,6 
for any 4 = 1,2 and belongs to the space 2 ® 20 in (11.270). 
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The rest of the commutation relations between the generators 
are given by 


1 
[BY Ajneal = —O Ajney + 5 OM AjKeds 
eae Atmn,] = =i Abenfiy = On, Abknps Fy 0, Atmk,yu 


A eee 
ar 9 0 Abie is 


2 


[Avgens Actes | == epkeake So (euaB, + €p,B",) 
=i 


6 
Pp 
— Ew y (€jreabpX”. + €jkeapeX’y, + €jkepbeX”, 
p=l 


—€;kpabeX y = EjplabeX 4, =< EpkbaboX 4 ’ (11.273) 
a 


for u,v = 1,2 and j,k, f,a,b,c = 1,2,--- ,6. Here €,,, denotes the two 
dimensional totally anti-symmetric Levi-Civita tensor with «jg = 1 
while €jx¢abe Stands for the six dimensional totally anti-symmetric 
Levi-Civita tensor. We note here that the Jacobi identity 


[Aate,r, [Ajke,., Adef,v]] + cyclic sum = 0, (11.274) 
holds because 

ewAsnex + GrAgne + OpAmen = 0, (11.275) 
which reflects the fact that there cannot be a third rank anti-symmetric 


tensor in a two dimensional space. We conclude this discussion by 
noting that the second order Casimir invariant, in this case, is given 


by 


6 2 
flo) = 3 X54 + BBY 
jk=1 bv=l 


2 6 
1 
Wis Se s eli efitabe A oy Anders (11.276) 
Hyv=1 j,k,l,a,b,c=1 


and that the Dynkin diagram for Eg has the form shown in Fig. 11.9. 
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a3 
O —— = 
Qa) a2 a4 a5 
S) 
a6 


Figure 11.9: The Dynkin diagram for the simple roots of Eg. 


11.6.4 E,. The exceptional Lie algebra L = E7 is known to be 133- 
dimensional and has a maximal Lie sub-algebra Lo = A7 = su(8). 
Therefore, as a representation space, we can decompose E7 as 


133 = 63+ 70, (11.277) 


where the 63-dimensional representation can be identified with the 
adjoint representation X’,, while the 70-dimensional representation 
corresponds to the totally anti-symmetric tensor Ajx¢m with j,k, ¢,m = 
1,2,---,8. The explicit commutation relations for these generators 
are given by (see (11.88) and note that the other commutation rela- 
tions are obtained from the transformation of tensors discussed, say, 


in (3.92)) 


8 . 
DS 
> x7; =0, 
j=l 
Xe Re SO =O XG. 
ix Atmnp| = <O Apoins — 62, Acknp a 6, Aemkp = 63 Atmnk 


Tee 
or 9 ok Aemnp: 


8 
: 8 8 
[Ajkem, Aner = lsktowepan j a €jsémnpqrX k 


s=1 


s 8 s 
ae Ejksmnpqr L ag €jkesnpar-X prea €jkemspqr & n 


ale EjklmnsqrX p ar €jklmnpsrX gq ar Sninieied | , (11.278) 


where €jk¢mnpgr denotes the totally anti-symmetric Levi-Civita tensor 
in 8-dimensions. The Dynkin diagram for £7 is shown in Fig. 11.10. 
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Figure 11.10: The Dynkin diagram for the simple roots of F7. 


11.6.5 Ey. The exceptional Lie algebra LZ = Eg is known to be 248- 
dimensional and contains as maximal Lie sub-algebra Lp = Dg = 
so(16). Correspondingly, Eg decomposes as a representation space as 


248 = 120 + 128, (11.279) 


in terms of the 120-dimensional adjoint representation X;,, j,k = 
1,2,--- ,16 and the 2’ = 128 dimensional (irreducible) spinor rep- 
resentation ~,,,)6 = 1,2,--+ ,128 (which can be a Weyl/Majorana 
spinor). 

The commutation relations between Xj, and yw, are given by 
(see also (11.260) and (11.261)) 


16 ; 
S230 = 0; 
j=l 


X jk, Xtm] = —55eXkm — OemX je + SjmXke + OkeX jm; 


128 


X jk; Wu] = S° Wy (7k) yy ’ 
v=1 


16 
Darby) = 2S) (Cog), Xan: (11.280) 
jk=1 
Here j,k, ¢,m = 1,2,--- ,16 while p,v = 1,2,--- , 128. Furthermore, 
oj~ (and oj,) are defined as (see discussions from (6.100)-(6.106) 


where oj, and oj, are labelled as ID and Ie respectively) 


ih 
Ojk = (o;0}, ax oxo!) 5 


4 
1 
4 


Tix = = (olen — oho), (11.281) 
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in terms of the 128 x 128 dimensional matrices o; satisfying 
ag ee een ee ti 
O50, + OKO; = 26;~ 1 = T;Ok + O40), (11.282) 
which define the generators of the Clifford algebra y; as in (6.101) 


0 a; 
y= ( ; ; (11.283) 
o, O 


J 


Moreover, we note that C in (11.280) denotes the 128-dimensional 
charge conjugation matrix defined by 


cs 
(Oy Cog. = (ct) ; 
(i) ColC=(a;)", 
(wi) CT =C. (11.284) 
And, as in (11.265)-(11.266), here also the Jacobi identity 


[Xjr: [Pps Pe] eo [Pus [bv X;x]] + [wr [Xje, Yul] = 0, (11.285) 


holds because of the anti-symmetry under charge conjugation 


(CoxK) wy = — (Copk) yy (11.286) 
while the Jacobi identity 
[Dus [bys Pal] + (dv, [as Yul] + [das Pu, dr] = 0, (11.287) 
Pore WISE on 4 108 Pllowe trom the identity, 
16 
So [(Cojn) wr (oir ar + (Coze)ua(oje)op + (Cojk)ru(oje)av] = 0. 
— (11.288) 


As we have pointed out in (11.268), this identity results from 
the orthonormality relations for the 7; matrices in even dimensions. 
In fact, let us consider a slightly more general case of so(N) Clifford 
algebra with 


d=2”, for N=2M +1, 
¢=o! 1 for N=2M, (11.289) 
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denoting the dimensions of the oj; matrices. Let us suppose that 


N 

S 2 [(Coje) ww (ojr ar + (Coje)va(ojk)op + (Cojk)ru(7jr)ov] = 0. 
ik=1 
(11.290) 


for u,v,’ = 1,2,---,d. This condition can be checked as follows to 
be satisfied only for three cases. Let us multiply (11.290) by (gem) yo 
and sum over the indices A, a. This leads to 


N 
Yo [(C oie) py TH O52Tem) + (Cojo em Aik) yp (COjR ome jk) | = 9 
jk=l 
(11.291) 


where we have used the anti-symmetry in (11.286). Furthermore, 
using the identities 


N 

1 
) OjkOlmOjk = rica ON 4 16) Om, 
jk=1 


Tr (0j17%m) = —5 4 (Sy0bbm — Bidet) (11.292) 
in (11.291), we obtain 

N?-9N+4+16=d. (11.293) 
This has only three solutions given by 


(i) N=16, M=8, asin Eg > so(16), 
(i) N=9, M=4, asin Fy - s0(9), 
(ii) N=8, M=4, _ for so(9) > so(8). (11.294) 


We conclude the discussion on Eg by noting that the Dynkin diagram 
for the simple roots is shown in Fig. 11.11. 

We conclude this section on exceptional Lie algebras by not- 
ing that there are several ways to construct these algebras, espe- 
cially in terms of the octonion algebras. Let us mention this briefly 
here. Let A be an algebra (which may be non-associative) with a 
symmetric bilinear non-degenerate form (z|y) = (y|x). Here, non- 
degeneracy simply implies that if (a|z) = 0 for all « € A, then 
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a ae: 
O 


as 


Figure 11.11: The Dynkin diagram for the simple roots of Eg. 


a = 0. Furthermore, let us suppose that we have the composition 
law (xy|xy) = (x|x) (y|y). Then A is called a composition algebra. If 
we assume that A, in addition, is unital, namely, it possesses the unit 
element e, then A is known as a unital composition algebra or sim- 
ply a Hurwitz algebra. In this case, the Hurwitz theorem states that 
there are only four possibilities for such an algebra of dimensionalities 
1, 2,4,8, corresponding respectively to the real and complex number 
fields, quarternion and octonion algebras. 

The classical construction of the exceptional Lie algebras Fy, Ee, 
E7 and Eg is due to Tits (see the book by Schafer), where one uses 
both Hurwitz algebra together with Jordan algebra consisting of 3 x 3 
matrices over the Hurwitz algebra. This leads to Freudenthal’s magic 
square. Another more symmetric approach is due to Elduque which 
utilizes two independent Hurwitz algebras to obtain essentially the 
same result. The method also uses the triality relations associated 
with Hurwitz algebras. However, these are quite technical and we 
will not go into the details of this. 
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